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Summary 

This  research  is  concerned  with  investigating  methods  for  the  control  of  McKibben 
pneumatic  actuators,  or  pneumatic  muscles  (PMs).  PMs  are  a  novel  type  of  actuator  that 
closely  mimic  human  skeletal  muscles  in  size  and  power  capabilities.  PMs  are 
considered  by  the  Air  Force  for  use  in  exoskeletons  to  be  worn  by  humans  for  strength 
augmentation  and  rehabilitation  after  injury  or  illness,  and  for  use  as  actuators  in  robotic 
systems.  The  control  of  PMs  is  a  challenging  problem  due  to  their  highly  nonlinear  and 
time-varying  nature.  In  this  research,  we  investigate  adaptive,  sliding  mode,  and  soft 
computing  approaches  to  control  of  PMs  and  robotic  systems  actuated  by  PMs.  The  soft 
computing  approaches  include  neuro-fuzzy  modeling  of  an  actual  PM  in  the  Human 
Effectiveness  Lab  at  Wright  Patterson  Air  Force  Base,  and  evolutionary  design  of  a  fuzzy 
PID  controller  based  on  this  model.  We  also  investigate  a  type  of  MIMO  fuzzy  model 
predictive  control  for  a  planar  arm  actuated  by  four  PMs.  Some  of  the  controllers  are 
tested  on  the  actual  PM  at  WPAFB  while  others  are  proven  in  simulations.  A  byproduct 
of  this  research  is  an  evolutionary  fuzzy  training  algorithm  useful  for  identification  of 
d)mamical  systems  as  well  as  classification  problems. 
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1 


Objectives 


1.1  Introduction 

McKibben  pneumatic  actuators,  or  pnemnatic  muscles  (PM),  are  a  novel  type  of 
pneumatic  actuator  consisting  of  a  cylindrical,  flexible  rubber  or  plastic  airtight  tube 
inside  a  braided  plastic  sheath  (see  Figure  2.1).  When  the  tube  is  inflated  it  widens  and, 
due  to  the  braided  sheath,  shortens.  When  it  shortens,  the  contractile  force  exerted  is 
quite  large  in  proportion  to  the  PM's  weight.  Pneumatic  muscles  have  the  highest 
power/weight  ratio  (IkW/kg  [1])  and  power/volume  ratio  (IW/cm’  [2])  of  any  actuator. 
They  are  roughly  the  same  as  human  skeletal  muscles  in  size,  shape,  and  power  output. 
A  significant  advantage  of  PM  actuators  is  the  ability  to  make  them  autonomous.  They 
can  be  energized  from  a  small  canister  of  gas  that  can  rapidly  create,  from  a  chemical 
reaction,  large  pressures  for  inflation  of  the  muscle. 

PMs  have  been  used  for  years  in  robotics  to  perform  precision  manipulation  tasks 
([3]-[ll]),  They  can  also  be  used  to  actuate  an  exoskeleton  frame  worn  by  humans  to 
enhance  strength  and/or  mobility.  Concepts  developed  from  our  research  can  be  used  to 
help  the  disabled  obtain  enhanced  strength  and  mobility.  Such  people  have  suffered  from 
stroke,  accidents,  or  other  problems  to  reduce  their  physical  capabilities. 

In  order  for  PMs  to  be  used  for  precision  robotics  or  for  exoskeleton  actuation,  it  is 
necessary  to  be  able  to  control  them  precisely.  Since  they  are  highly  nonlinear  and  time 
varying,  their  control  is  a  challenging  problem.  Our  research  studies  the  closed-loop 
control  of  PM  systems  for  accurate  position  control.  Position  control  for  PMs  refers  to 
the  control  of  their  length  under  varying  loads.  When  actuating  robots  or  exoskeletons, 
this  translates  into  accurate  control  of  joint  angles  or  end-effector  spatial  positioning. 

There  have  been  several  investigations  into  applications  of  PMs  and  their  properties 
([6]-[16]).  Pneumatic  muscle  research  is  ongoing  at  the  Human  Sensory  Feedback  (HSF) 
Laboratory  at  Wright  Patterson  Air  Force  Base  (WPAFB)  ([6],  [7]).  The  HSF  Lab 
contains  a  PM  test  station  that  consists  of  several  PMs,  sensors,  actuators,  and 
instmmentation  to  control  the  PMs’  operation.  In  our  research,  we  have  interacted  to  a 
great  extent  with  the  personnel  at  WPAFB,  testing  our  controllers  on  the  PMs  in  their  lab. 
We  are  indebted  to  the  personnel  in  this  lab  for  their  help  in  performing  our  research. 


1.2  Objectives  and  Summary  of  Research  on  This  Project 

In  this  research,  we  are  concerned  with  PM  position  control  when  used  in 
configurations  conunon  to  anthropomorphic  robotic  systems.  Since  the  PM  is  nonlinear 
and  time- varying,  all  controllers  considered  in  this  research  were  chosen  because  they  are 
known  to  exhibit  robustness  to  parameter  uncertainties.  The  progression  of  the  research 
on  this  project  is  as  follows. 
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Chapter  2;  As  a  first  step,  we  are  concerned  with  obtaining  accurate  mathematical 
models  for  the  PM.  In  this  project  we  have  utilized  two  similar  models,  both  of  which 
were  derived  by  the  researchers  at  WPAFB.  The  models  are  second-order  differential 
equations  with  nonlinear  coefficients.  These  models  have  aided  us  inuneasurably  in  our 
derivation  of  controllers  and  in  simulating  the  closed-loop  behavior  of  the  PMs. 

Chapter  3:  We  consider  nonlinear  adaptive  control  of  PMs  actuating  a  planar  arm.  The 
PMs  act  individually  in  bicep  and  tricep  configurations.  The  control  objective  is  elbow 
angle  tracking.  Simulations  of  the  controlled  system  are  presented. 

Chapter  4:  We  consider  single-input  sliding  mode  control  of  a  planar  arm  actuated  by 
two  PMs  acting  simultaneously  in  opposing  pair  configuration,  similar  to  human 
bicep/tricep  interaction.  The  control  objective  is  elbow  angle  tracking.  Simulation 
results  are  presented. 

Chapter  5:  We  consider  two-input  sliding  mode  control  of  a  planar  arm  actuated  by  four 
PM  groups  acting  simultaneously  in  opposing  pair  configuration.  The  actuator  system 
consists  of  two  agonist/antagonist  pairs  -  one  pair  actuating  the  shoulder,  and  another 
pair  actuating  the  elbow.  The  control  objective  is  spatial  end-effector  (hand)  tracking. 
Simulation  results  are  presented. 

Chapter  6:  We  derive  a  recurrent  neuro-fiizzy  model  of  a  single  PM  hanging  vertically 
actuating  a  mass.  The  model  is  derived  fi'om  data  taken  in  the  HSF  lab  at  WPAFB.  This 
model  is  used  in  Chapter  9  to  derive  a  very  effective  fiizzy  controller  for  the  PM  in  the 
HSF  lab. 

Chapter  7:  We  present  a  new  method  for  fiizzy  classification  that  is  a  byproduct  of  our 
research  on  this  project.  The  method  utilizes  the  VISIT  algorithm  (pioneered  by  the  PI 
and  coworkers)  and  proposes  an  evolutionary  method  of  tuning  the  parameters  for 
optimal  learning. 

Chapter  8:  We  apply  the  Fuzzy  Model  Reference  Learning  Control  (FMRLC) 
methodology  to  PM  control.  Results  from  applying  the  controller  to  the  PM  in  the  HSF 
lab  at  WPAFB  are  presented. 

Chapter  9:  We  derive  a  fiizzy  P+ID  controller  for  the  PM  in  the  HSF  lab  at  WPAFB. 
The  neuro-fiizzy  PM  model  derived  in  Chapter  6  is  utilized,  and  evolutionary  tuning  of 
controller  parameters  is  implemented.  Experimental  results  are  presented. 

Chapter  10;  We  derive  a  fiizzy  model  predictive  controller  for  a  planar  arm  actuated  by 
four  PM  groups  in  opposing  pairs,  as  in  Chapter  5.  Simulation  results  are  presented. 
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2  PM  Differential  Equation  Models  used  in  this  Research 


2.1  Introduction 

Attempts  have  been  made  to  model  PMs  in  various  ways,  including  fuzzy  systems 
and  neural  networks.  In  order  to  design  adaptive  and  sliding  mode  controllers,  a 
differential  equation  model  of  the  PM  is  necessary.  Obtaining  a  DE  model  of  the  PM 
from  first  principles  of  physics  is  difficult,  due  to  the  way  the  PM  is  constructed  (see 
Figure  2.1). 


Gas  inlet 
and  outlet 


Attachment 

cable 


Cylindrical 
rubber  tube 


Attachment 

cable 


Sheath  encloses 
rubber  tube 


Cylindrical 
braided  sheath 


Figure  2.1-  Construction  of  pneumatic  muscle 


Alternatively,  a  DE  model  can  be  derived  for  a  particular  PM  from  ejqperimentation 
in  the  lab.  The  Human  Sensory  Feedback  laboratory  at  WPAFB  contains  an 
experimental  PM  setup  that  can  be  used  to  take  sufficient  measurements  to  derive  models 
for  particular  PMs.  This  has  been  done  in  two  separate  studies  ([7],  [17]),  both  resulting 
in  nonlinear  second-order  ODE  models.  Historically,  the  first  of  these  was  done  by 
Repperger  et  al.  [7].  In  this  model,  the  coefficients  are  calculated  as  nonlinear  functions 
of  the  PM  length  Recently,  a  similar  model,  due  to  Reynolds  et  al.  [17]  was  derived  in 
which  the  coefficients  depend  nonlinearly  on  the  PM  internal  pressure.  In  both  models. 
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the  coefficients  depend  on  whether  the  PM  is  being  inflated  or  deflated.  Both  models 
assume  an  equivalent  structure  for  the  PM  of  a  parallel  connection  of  a  nonlinear  spring, 
a  nonlinear  viscous  friction,  and  a  contractile  element  (Figure  2.2). 


Figure  2.2  —  Three-element  model  for  PM 


2.2  Reppei^er  Model  [7] 

The  particular  PM  modeled  in  [7,  17]  has  an  inner  bladder  made  from  a  section  of 

22.2  mm  diameter  bicycle  tubing  enclosed  in  a  helically-wound  nylon  sheath  used  for 
supporting  electrical  cables.  The  unstretched,  uncompressed  diameter  of  the  sheath  is 
31.75  mm.  In  the  Repperger  model,  the  coefficient  K{x)  is  a  nonlinear  function  of  the 
PM  length  x,  and  B{x)  is  a  nonlinear  function  of  the  PM  rate  of  change  of  length.  The 
muscle  is  inflated  (hence  shortened)  by  opening  a  solenoid  that  controls  the  flow  of 
pressurized  gas  into  the  rubber  bladder.  It  is  deflated  by  opening  another  solenoid 
venting  the  contents  of  the  bladder  to  the  atmosphere. 

Figure  2.3  shows  a  pneumatic  muscle  being  inflated  and  lifting  a  mass.  Let  the 
position  of  the  mass  when  the  PM  is  uninflated  be  defined  as  x  =  0.  If  x(t)  is  the 
vertical  position  of  the  mass,  the  differential  equation  describing  the  system  of  Figure  2.3 
is 


Mx+B(x)x  +  K(x')x  =  F-Mg  (2.2.1) 

where  g  is  the  acceleration  of  gravity.  The  coefficients  B{x)  and  K(x)  depend  on 
whether 
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Figure  2.3  -  PM  hanging  vertically  lifting  a  mass 


the  PM  is  being  inflated  or  deflated  and  are  defined  as  [7] 


Inflation: 


Deflation: 


R.(x)  =  0.04x^+1.3x  +  12.6 
(x)  =  1. 6x2 +10.9X  + 27.1 

(2.2.2) 

{Bj{x)  =  QA2x^  +  2.49X  + 14.48 

1  Kj  (x)  =  3.6x^  +  20.7x  +  47.23 

(2.2.3) 

In  (2.2.1),  the  system  input  F  is  the  upward  force  exerted  on  the  mass  by  the  PM. 
It  is  an  independent  control  variable  that  can  be  externally  commanded  by  adjusting  the 
PM  infernal  pressure.  The  force  exerted  by  the  viscous  fi-iction  action  of  the  PM  is  given 
by  -  B{x)x ,  and  the  force  due  to  the  spring  action  of  the  PM  is  given  by  -  K{x)x . 


2.3  Reynolds  Model  [17] 

The  PM  is  again  modeled  by  the  3-element  mechanical  model  shown  in  Fig.  2.2, 
consisting  of  a  contractile  element,  dashpot,  and  spring.  In  the  Reynolds  model,  these 
three  elements  all  have  pressure-dependent  coefficients.  It  is  shown  in  [17]  that  the 
system  of  Figure  2.3  can  be  modeled  as 

Mx+B{P)x  +  K(p)x=-F(p)-Mg  (2.3.1) 

where  M  is  the  mass  lifted  by  the  PM,  B  is  the  coefficient  of  viscous  friction,  K  is  the 
spring  coefficient,  F  is  the  force  exerted  by  the  contractile  element,  and  g  is  the 
acceleration  of  the  gravity. 


According  to  [17],  B,  K  and  F  for  the  specific  PM  considered  are  dependent  on  the 
internal  pressure  of  the  PM  and  are  given  in  SI  units  as: 

P  =  179.2 +  1.39P 

(2.3.2a) 

.^:  =  5.71  +  0.0307P 

(2.3.2b) 

f  1. 01 +  0.00691P  (Inflation) 

"  |o.6  -  0.000803P  (Deflation) 

(2.3.2c) 

where  P  is  the  PM  supply  pressure  in  kPa.  The  coefficients  specified  in  (2.3.2)  are  valid 
in  the  range  206.844  <P<  620.532  kPa  (30  <  P  <  90  psi) .  Note  that  the  coeflScients  B  in 
deflation  are  smaller  than  in  inflation.  The  reason  is  because  in  deflation  the  PM  system 
vents  against  a  constant  atmospheric  pressure.  During  inflation,  however,  the  pressure 
buildup  is  in  a  closed  volume  and  the  forcing  function  has  to  fight  against  the  increasing 
PM  internal  pressure  as  the  PM  inflates. 

Note  from  (2.3.2)  that  the  contractile  force  F,  viscous  damping  coefficient  B  and 
spring  coefficient  K  are  functions  of  P,  which  is  the  control  variable,  le.  P  is  the 
independent  variable  that  can  be  commanded  by  the  controller.  Therefore,  in  this  model 
the  control  variable  enters  the  equations  of  motion  via  the  coefficients  B,  K,  and  F,  a 
situation  that  is  different  from  most  conventional  control  problems. 
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3  Adaptive  Control  of  a  Planar  Arm  Actuated  by  PMs  in 
Bicep  and  Tricep  Configurations 


3.1  Introduction 

In  the  adaptive  control  approach  ([18],  [19]),  we  utilize  the  Repperger  model  of  the 
PM  (2.2.1-2.2.3)  [7].  In  nonlinear  adaptive  control,  parts  of  the  model  are  assumed 
known,  and  their  multiplying  coefficients  are  treated  as  unknown,  with  adaptive  laws  for 
driving  the  parameter  estimates  to  their  true  values.  The  strengths  of  the  adaptive 
approach  are  robustness  to  parameter  uncertainties  together  with  ease  of  implementation. 
A  weakness  is  the  necessity  of  knowing  the  general  form  of  the  plant  model. 


3.2  Dynamic  Modeling  of  Limbs  with  PM  in  Bicep  and  Tricep  Positions 

The  two  basic  configurations  in  which  the  PM  can  be  arranged  for  use  in 
exoskeletons  are  the  bicep-type  (Figure  3.1)  and  tricep-type  (Figure  3.2)  configurations. 
In  this  chapter,  the  control  problem  for  both  configurations  is  precise  control  of  the  joint 
angle  of  a  limb  which  is  holding  a  mass.  Specifically,  we  wish  to  actuate  the  PM  by 
inflating  and  deflating  it  in  such  a  way  that  the  joint  angle  follows  a  reference  function  of 
time  while  the  limb  holds  a  mass. 

PM  in  Bicep  Configuration 

Consider  the  limb  configuration  shown  in  Figure  3.1,  which  depicts  an  arm  lifting  a 
mass,  with  the  PM  in  the  position  of  a  bicep.  The  upper  arm  remains  stationary  as  the 
PM  expands  and  contracts,  moving  the  forearm.  The  upper  end  of  the  PM  and  upper  arm 
are  attached  to  a  motionless  reference  point.  The  mass  M  is  held  at  the  end  of  the 
forearm.  The  forearm,  which  is  considered  massless,  is  attached  to  the  upper  arm  by  a 
frictionless  joint.  The  PM  is  attached  to  the  forearm  at  point  A ,  which  is  a  distance  a 
from  the  joint.  The  distance  from  the  center  of  mass  of  the  load  to  the  joint  is  L .  The 
forearm  is  free  to  rotate  through  an  angle  0 ,  where  ^  =  0’  corresponds  to  the  arm  being 
fully  bent,  i.e.  the  mass  in  the  extreme  upward  position,  and  0  =  \  80°  corresponds  to  the 
arm  being  fully  straightened,  Le.  the  mass  in  the  extreme  downward  position.  For 
simplicity,  we  will  assume  the  PM  force  always  acts  parallel  to  the  forearm.  This 
assumption  is  valid  so  long  as  0  is  not  close  to  either  of  its  extremes. 

Since  the  upward  force  exerted  by  the  PM  on  the  forearm  at  point  A  is 
F  B{x)x  +  K(x)x ,  the  clockwise  torque  imparted  to  the  forearm  by  the  PM  is 

(F  +  Bix)x + A’(x)x)flsin  0 .  Therefore,  the  system  dynamics  are  described  by: 

-10  =  {F+B(x)x+K(x)x')asm0-MgLsm  0  (3.2.1) 

where  I  =  Ml}  is  the  moment  of  inertia  of  the  mass  about  the  joint,  g  is  the  acceleration 
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of  gravity,  and  MgLsviO  is  the  counterclockwise  torque  imparted  to  the  forearm  by 
gravity.  Then,  using  x  =  -fl(l  +  cos^)  and  x^aOsmO ,  we  can  rewrite  (3.2.1)  entirely 
in  terms  of  ^  as  follows: 


\\\x\\\\\ 


Figure  3.1  -  Planar  arm  with  PM  in  bicep  position  actuating  elbow  joint 


19  =  -Ba^O  sin  ^  ^ + Ka^  sin  0(l + cos  0)+  MgL  sin  0  -  Fa  sin  0  (3 .2.2) 

where  B  and  K  are  now  expressed  in  terms  of  0  and  0 . 

The  external  input  to  the  system  is  F ,  which  is  determined  by  how  much  the  PM  is 
inflated.  Note  that  since  F  is  multiplied  by  sin0  in  (3.2.2),  the  system  becomes 
uncontrollable  at  ^  =  0  and  at  0  =  180°.  For  this  reason,  joint  angles  should  not 
approach  these  limits.  We  will  see  that  the  tricep  configuration  does  not  have  this 
restriction. 

PM  in  Tricep  Configuration 

Figure  3.2  depicts  an  arm  lifting  a  mass,  with  the  PM  in  the  position  of  a  tricep. 
The  upper  arm  remains  stationary  as  the  PM  ejqpands  and  contracts,  moving  the  forearm. 
The  lower  end  of  the  PM  and  upper  arm  are  attached  to  a  motionless  reference  point.  The 
mass  M  is  held  at  the  end  of  the  forearm.  The  forearm,  which  is  considered  massless,  is 
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attached  to  the  upper  arm  by  a  frictionless  joint.  Also  at  the  joint  is  a  frictionless  pulley 
of  radius  r ,  over  which  a  cable  connecting  the  PM  to  the  forearm  passes.  The  PM  is 
attached  to  the  forearm  at  point  A,  which  is  a  distance  c  from  the  joint.  The  cable  makes 

an  angle  a  =  sin”^(/'/c)  with  the  forearm.  The  distance  from  the  center  of  mass  of  the 

load  to  the  joint  is  L .  The  forearm  is  free  to  rotate  through  an  angle  9,  where  ^  =  0° 
corresponds  to  the  arm  being  frilly  straightened,  i.e.  the  mass  in  the  extreme  upward 
position,  and  ^  =  180°  corresponds  to  the  arm  being  fully  bent,  i.€.  the  mass  in  the 
extreme  downward  position. 

Since  the  downward  force  exerted  by  the  PM  is  F  +  5(i:)x  +  K(x)x ,  the  clockwise 
torque  imparted  to  the  forearm  by  the  PM  is  (F  +  B(x)x + K(x)x)csma .  Therefore,  the 
system  dynamics  are  described  by: 

t 

f 


Figure  3.2  -  Planar  arm  with  PM  in  tricep  position  actuating  elbow  joint 


- 19  =  (F+ B{x)x + K{x)x)csma-MgLsm  9 

=  {F  +  B{x)x+K{x)xy-MgLsm9  (3.2.3) 


where  I  =  Ml}  is  the  moment  of  inertia  of  the  mass  about  the  joint,  g  is  the  acceleration 
of  gravity,  and  MgLsm9  is  the  counterclockwise  torque  imparted  to  the  forearm  by 

gravity.  Then,  using  x  =  -— ;ir(l  +  cos0)  and  x  =  —7tr^m9,  we  can  rewrite  (3.2.3) 

2  2 


entirely  in  terms  of  ^  as  follows: 


/^‘=:_:?^^in^+-^^(l  +  cos^)  +  MgZsin^-rF  (3.2.4) 

2  2 

where  i?  and  A"  are  now  expressed  in  terms  of  ^  and  ^ .  Note  that  the  system  with  PM 
in  tricep  position  is  controllable  for  all  0  because  the  force  exerted  by  the  PM  always 
acts  at  an  angle  a  to  the  forearm  regardless  of  joint  angle. 


3.3  Adaptive  Tracking  for  Limbs  with  PM  in  Bicep  and  Tricep  Positions 

The  mass  M  manipulated  by  the  PM  can  be  expected  to  vary  significantly  from 
use  to  use.  Also,  the  coefficients  B  and  K  will  vary  with  PM  temperature,  and  from  unit 
to  unit.  Also,  the  physical  distances  r ,  a ,  and  i  may  vary  from  unit  to  unit.  Therefore, 
the  bicep  (3.2.2)  and  tricep  (3.2.4)  models  are  poorly  known  and  time-varying,  making 
nonadaptive  control  methods  vulnerable  to  failure  when  used  for  tracking  performance  of 
the  PM-  Since  the  nonlinear  functions  of  9  are  known  in  (3.2.2)  and  (3.2.4)  and  only 
their  coefficients  are  uncertain,  we  utilize  a  method  of  nonlinear  adaptive  tracking  based 
on  sliding  control  ([20],  [21]).  It  uses  a  well-known  result  from  model  reference  adaptive 
control,  which  we  give  without  proof  (see,  e.g.  [20]). 

Lemma:  Consider  two  signals  e  arid  ^  related  by  the  following  dynamic  equation 


e{t)^H{p)[kf{t)v{t)]  (3.3.1) 

■where  e{t)  is  a  scalar  output  signal,  H(p)  is  a  strictly  positive  real  transfer  function,  k  is 
an  unknown  constant  with  a  known  sign,  <p(t)  is  a  mxl  vector  function  of  time,  and  v(f) 
is  a  measurable  mxl  vector.  If  the  vector  ^  varies  according  to 

^(t)  = -sgn(k)j€v(t)  (3.3.2) 

■with  y  being  a  positive  constant,  then  e{t)  and  ^(t)  are  globally  bounded.  Furthermore, 
if  V  is  bounded,  then  e{t)  ->  0  t  ->  oo . 

Bicep  Adaptive  Tracking 

Consider  the  problem  of  the  arm  lifting  a  mass  with  PM  in  bicep  position  as  in 
Figure  3.1.  If  we  substitute  x  =  -a(l  +  cos^)  and  x  =  a9sm0  in  (3.2.2),  we  get  an 
equation  in  the  form 
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hO  +  a\^  sm^  6+ sin^  d + a-},6  sin^  ^  +  04  sin  0(\. + cos 
+a5  sin^(l  +  cos^)^  +flr6sin  ^(1  +  cos^)  +  a7sm0  =  — Fsin  0 


or 


1 

h0  +  ^aifi{0,0)=-Fsm0  (3.3.3) 

1=1 

where  h^I/a,  a, ,...,07  are  parameters  which  depend  on  the  physical  properties  of  the 
system  (i.e.  Af ,  a,  i,  the  coefficients  in  (2.2.2)  and  (2.2.3),  etc.),  and  yj,...,/7  are 
known  functions  of  0  and  0. 

Assume  that  h,  are  unknown  and  it  is  desired  that  the  PM  angle  ^(0 

track  a  known  angle  0^(0-  Define  the  error  e(t)  —  0{t)  —  0j(t).  Also  define  the 
auxiliary  signal 

=  (3.3.4) 

where  Aq  is  a  positive  constant  and  the  combined  error 

s  =  e  +  XQe  (3.3.5) 


Consider  a  control  F  such  that 

7 

-Fsm0  =  hyj.—ks  +  ^^aifi  (3.3.6) 

i=\ 

where  it  is  a  positive  constant  and  h,  are  estimates  of  the  unknown  parameters 

h,a^,...,aT  With  this  control  law,  we  have  the  following  result  concerning  the  stability 
of  asymptotic  tracking  of  the  arm  with  PM  in  bicep  position: 

Theorem  h  Consider  the  PM  in  bicep  position  moving  a  mass  (Figure  3.1). 
Assume  the  PM  spring  and  viscous  fi'iction  coefficients  are  as  in  (2.2.2).  If  the  force  F 
dehvered  by  the  PM  satisfies  (3.3.6),  then  all  signals  of  the  adaptive  system  are  bounded 
with  lim  e(t)  =  0  provided  the  parameter  estimates  are  adjusted  according  to 
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h  =  -}syr 

a,=-ysfi,  /  =  1,...7 

where  y  is  a  positive  constant. 

Proof:  It  can  be  shown  that  the  tracking  error  from  control  law  (3.3.6)  is 

7 

hs  +  ks  =  hyr  +  ^aifi  (3.3.8) 

1=1 


(3.3.7a) 

(3.3.7b) 


where  h=h-h  and  3^  =  -  a, .  This  can  be  rewritten  as 


s  = 


llh 


p  +  k/h 


hyr+Yi^ifi 

i=l 


(3.3.9) 


This  is  in  the  form  of  (3.3.1)  with  the  transfer  function  obviously  being  strictly  positive 
real.  Therefore,  we  have  from  the  lemma  that  all  signals  of  the  adaptive  system  are 
bounded. 

Consider  the  Lyapunov  frinction  candidate 


V  =  hs^ 


7 


(3.3.10) 


where  h=h-h  and  ai=ai-  a/ .  It  is  straightforward  to  show  that  the  derivative  of  V 
along  the  trajectories  of  the  closed-loop  system  is  given  by 

V  =  -2ks^  (3.3.11) 

Therefore,  we  have  j-^Oast— >oo.  It  follows  that  lim  e{t)  =  0 .  0 

Tricep  Adaptive  Tracking 

Consider  the  problem  of  the  arm  moving  a  mass  with  PM  in  tricep  position  as  in 

Figure  3.2.  If  we  substitute  jc  =  -— ;7r(l+cos^)  and  x-^7ir0smd  in  (3.2.4),  we  get  an 

2  2 

equation  in  the  form 
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h6  +  a^  (6  sin  0f  +  0  sin  Of  +  aj^  sin  0  +  04  (1  +  cos  Of 

+ aj  (1  +  cos  0)^  +  (1 + cos  ^) + O7  sin  ^  =  -F 


or 


he+'Za,f(0.e)  =  -F  (3.3.12) 

1=1 

where  h^I/r,  a, , . . . ,  a,  are  parameters  which  depend  on  the  physical  properties  of  the 
system  (i.e.  M,a,L,  the  coefficients  in  (2.2.2)  and  (2.2.3),  etc.),  and  are 

known  fimctions  of  0  and  0 . 

Assume  that  h,a^,...,a.j  are  unknown  and  it  is  desired  that  the  PM  angle  0{f) 
track  a  known  angle  0a{t).  Define  the  quantities  e{f)  =  0{f)-0d{f),  yr(0>  ^nd  s  as 
above. 

Consider  a  control  F  such  that 


7 

-F  =  +  (3.3.13) 

/=! 

where  k  is  a  positive  constant  and  h,a^,...,a^  are  estimates  of  the  unknown  parameters 
h,a^,...aT.  Then  we  have  the  following  result  concerning  the  stability  of  asymptotic 
tracking  of  the  arm  with  PM  in  tricep  position  using  the  control  (3.3.13): 

Theorem  2:  Consider  the  PM  in  tricep  position  lifting  a  mass  (Figure  3.2).  Assume 
the  PM  spring  and  viscous  friction  coefiBcients  are  as  in  (2.2.2).  If  the  force  F  delivered 
by  the  PM  satisfies  (3.3.13),  then  all  signals  of  the  closed-loop  sj^tem  are  bounded  with 
lim  e(t)  =  0  provided  the  parameter  estimates  are  adjusted  according  to 

f-^00 


Cl: 


h  =  -pyr 

~l^fi »  /  =  1, . .  •  ,7 


(3.3.14a) 

(3.3.14b) 


where  y  is  a  positive  constant. 

Proof  :  The  proof  is  similar  to  that  of  Theorem  1. 

Comment  1:  To  implement  the  control  laws  (3.3.6)  and  (3.3.13)  it  is  necessary  to  measure 
^  and  This  should  be  no  problem  in  PM  applications,  because  these  are  the  joint 
angle  and  its  rate  of  change,  respectively  and  are  easily  measured. 
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Comment  2:  As  stated  above,  the  bicep  control  F  is  multiplied  by  sin0 ,  which  vanishes 
as  the  arm  approaches  either  the  vertical-up  or  vertical-down  position.  Thus,  the  arm 
cannot  be  controlled  in  the  vicinity  of  these  positions.  For  this  reason,  care  should  be 
taken  to  avoid  arm  angles  close  to  vertical-up  or  vertical-down  for  bicep  control.  Tricep 
configuration  does  not  have  this  limitation  due  to  the  fact  that  the  force  is  always  applied 
at  an  angle  a  to  the  forearm,  regardless  of  the  joint  angle. 

Comment  3:  The  assumption  of  coefficients  (2.2.2)  are  necessary  so  that  the  plant 
parameters  are  constants.  This  assumption  is  equivalent  to  stipulating  that  the  PM  is  not 
allowed  to  deflate.  This  may  be  the  case  if  e.g.  the  task  is  to  lift  a  mass.  If  the  PM  were 
inflating  and  deflating,  the  PM  spring  and  viscous  friction  coefficients  would  switch 
between  (2.2.2)  and  (2.2.3).  Therefore,  the  plant  parameters  would  be  time  varying  and 
the  proofs  to  Theorems  1  and  2  would  be  considerably  more  difficult.  The  assumptions 
are  technically  convenient  to  prove  the  theorems,  but  do  not  appear  to  be  necessary  for 
asymptotic  tracking  in  simulations  in  which  both  inflation  and  deflation  are  involved.  Of 
course,  the  theorems  also  apply  to  the  case  where  only  coefficients  (2.2.3)  are  assumed. 
In  this  case  the  PM  is  not  allowed  to  inflate  but  only  deflate,  which  might  be  the  case 
when  the  task  is  to  lower  the  mass. 


3.4  Simulation  Results 

The  systems  of  (3.2.2)  and  (3.2.4)  were  simulated  using  a  4th-order  Runge-Kutta 
algorithm  with  a  step  size  of  0.01  seconds.  The  control  for  the  bicep  configuration  is 
given  by  (3.3.6)  and  (3.3.7).  The  control  for  the  tricep  configuration  is  given  by  (3.3.13) 
and  (3.3.14).  The  results  of  these  simulations  are  given  below. 

Bicep  Simulation 

Assume  a  configuration  as  in  Figure  3.1.  Let  Z  =  0.5m,  a  =  0.025  m,  M  =  50kg, 
and  g  =  9.807m/sec^  .  Since  «  =  0.025  m,  the  full  travel  of  the  forearm  from  ^  =  180° 

(arm  fully  straightened)  to  ^  =  0°  (arm  fully  bent)  corresponds  to  a  maximum  change  in 
length  of  the  PM  of  0.05  m.  This  corresponds  approximately  to  the  actual  capability  of 
the  PM  considered  above. 

The  desired  trajectory  for  the  joint  angle  between  0  and  15  seconds  is 


^^(O  =  [60°+62.5“(sin(2;z/iO  +  sin(2;^2O)J  (3-4.1) 

with  fi  =0.01  Hz  and  /2  =0.1  Hz.  Therefore,  0j(t)  is  a  sum  of  two  sinusoids  with 

initial  condition  ^^(0)  =  60° .  This  trajectory  spans  monotonically  increasing  joint  angles 

from  60°  to  approximately  110°  and  corresponds  to  the  arm  lifting  a  along  the 
prescribed  trajectory. 
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The  input  to  the  PM  is  given  by  (3.3.6)  and  (3.3.7)  with  y=3xl0^,  ^=1,  and 

Jt  =  1 .  The  initial  parameter  guesses  are  zero,  and  the  initial  joint  angle  is  42° .  The 
desired  and  actual  Joint  angles  are  shown  in  Figure  3.3.  It  is  seen  that  the  arm 
asymptotically  tracks  after  the  initial  adaptation  stage  (approximately  4  seconds). 


Figure  3.3  -  Bicep  adaptive  tracking,  y  =  3xl0'*,  Ao=l,  k  =  l. 


Tricep  Simulation 

Assume  a  configuration  as  in  Figure  3.2.  Let  Z  =  0.5m,  r=(0.05/?r)m, 
M  =  50  kg,  and  g  =  9.807ni/sec^ .  Since  r  =  (0.05/;r)ni,  the  full  travel  of  the  forearm 

fi-om  0  =  180°  (arm  fully  bent)  to  0  =  0°  (arm  fully  straightened)  corresponds  to  a 
maximum  change  in  length  of  the  PM  of  0.05  m. 

The  desired  trajectory  for  the  joint  angle  is  again  as  in  (3.4.1),  which  corresponds  to 
lowering  the  mass  along  the  prescribed  trajectory.  The  input  to  the  PM  is  given  by 

(3.3.13)  and  (3.3.14)  with  y  =  1x10^,  =1 »  ^  =  1  •  Th®  parameter  guesses 

are  zero,  and  the  initial  joint  angle  is  42° .  The  desired  and  actual  joint  angles  are  shown 
in  Figure  3.4.  Again  we  have  asymptotic  tracking  except  in  the  initial  adaptation  stage. 
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Figure  3.4-  Tricep  adaptive  tracking,  /  =  lxlO^,  Aq=1,  k  =  \. 


3.5  Discussion 

The  simulations  of  PM  in  bicep  and  tricep  configurations  have  been  designed  to 
closely  conform  to  use  in  PM-actuated  exoskeletons.  In  an  exoskeleton,  there  are  no 
rigid  rods  for  forearm  and  upper  arm,  but  the  exoskeleton  may  possess  some  form  of 
rigidity,  i.e.  a  rigid  enclosure  for  a  human  limb.  Exoskeleton  PMs  are  arranged  in 
configurations  very  similar  to  human  skeletal  muscles,  i.e.  agonist/antagonist  or 
bicep/tricep  pairs.  The  bicep  and  tricep  results  in  this  paper  apply  to  PMs  used  anywhere 
in  an  exoskeleton  (arms,  legs,  etc.),  as  long  as  they  are  arranged  in  bicep  or  tricep 
configurations.  An  exoskeleton  PM  in  the  tricep  configuration  must  have  a  path  over 
which  the  PM  cable  passes  to  attach  to  the  limb  past  the  joint.  This  path  has  been 
modeled  as  a  fi'ictionless  pulley  in  this  study. 

In  typical  exoskeleton  applications,  the  mass  actuated  by  the  PM,  or  the  moment  of 
inertia  of  the  moving  joint,  will  vary  significantly  due  to  changing  joint  angles.  For 
instance,  when  moving  a  mass  fi-om  one  point  to  another,  the  arm  may  bend,  changing  the 
load  to  the  PM,  which  nevertheless  must  actuate  the  limb  to  follow  a  desired  reference 
trajectory.  This  situation  arises  in  robotics  as  well.  Also,  the  nonlinear  spring  and 
nonlinear  viscous  damping  coefficients  are  poorly  known  and  change  with  time.  This  is 
because  with  use,  the  PM  heats  up,  changing  these  coefficients.  In  addition,  physical 
properties  of  the  exoskeleton,  i.e.  arm  lengths,  distances  to  attach  points,  etc.  may  be 
poorly  known. 

Therefore,  adaptive  control  methods  have  been  applied  to  this  problem,  since  fixed 
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controllers  are  less  robust  to  parameter  changes  than  adaptive  ones.  The  simulations  in 
Section  3.4  were  also  carried  out  with  a  PID  controller  designed  to  give  good 
performance  with  M  =  50  kg.  If  the  mass  remains  in  the  vicinity  of  this  value,  the  PID 
gives  good  results.  However,  if  the  mass  changes  significantly,  the  fixed  PID  cannot 
stabilize  the  system.  With  the  adaptive  controllers  given  in  Section  3.3,  M  can  undergo 
a  threefold  change  while  retaining  adequate  tracking.  However,  the  fixed  PID  is  much 
less  tolerant  to  changes  in  M ,  failing  to  stabilize  the  i^stem  for  M  >  80  kg. 

Finally,  we  note  that  in  real  applications  of  PMs,  they  will  most  probably  be 
arranged  in  agonist/antagonist  pairs,  as  in  [4].  Therefore,  there  will  be  a  bicep/tricep  pair 
rather  than  a  single  bicep  or  tricep  acting  alone.  This  would  increase  joint  impedance  and 
result  in  a  more  stable  joint  angle  control  problem.  The  present  paper  is  intended  to  study 
the  action  of  individual  muscles  only,  without  introducing  agonist/antagonist  interaction. 


3.6  Conclusions 

Dyn^iiic  models  for  pneumatic  muscles  in  bicep  and  tricep  configurations 
actuating  a  mass  have  been  derived.  These  configurations  are  very  similar  to  exoskeleton 
applications  in  which  PMs  are  used  to  increase  strength  and  mobility  in  humans.  The 
models  are  second-order  and  nonlinear  in  the  joint  angle.  Their  form  makes  them 
amenable  to  nonlinear  adaptive  control  techniques,  since  the  nonlinear  functions  of  the 
joint  angle  are  known,  with  only  physical  constants  of  the  system  being  unknown. 
Simulations  of  closed-loop  adaptive  tracking  of  limbs  moving  masses  show  that  adaptive 
control  techniques  are  superior  to  fixed  methods,  i.e.  fixed  PID  controllers,  for  this 
application. 
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4 


Sliding  Mode  Control  of  Planar  Arm  with  Two  PMs 


4.1  Introduction 

Sliding  control  is  a  very  promising  method  of  PM  control.  It  has  the  advantage  that 
it  can  provide  accurate  tracking  with  bounded  error  in  the  presence  of  model  uncertainties. 
A  disadvantage  is  that  it  can  produce  a  high  amount  of  control  effort  due  to  chattering. 
This  is  because  the  control  law  is  discontinuous  across  the  sliding  surface.  For  this  reason, 
a  boundary  layer  is  usually  designed  into  the  control  law  in  which  the  control  is  linear  in 
the  vicinity  of  the  sliding  surface.  Sliding  mode  control  is  ideal  for  PM  control  because 
the  PM  model  is  usually  poorly  known,  nonlinear,  and  time-vaiying,  necessitating  some 
type  of  robust  control  strategy.  This  accounts  for  the  success  of  adaptive,  variable- 
structure,  and  soft  computing  approaches  also. 

In  this  chapter,  the  control  problem  is  elbow  angle  tracking  for  a  planar  arm.  The 
elbow  is  actuated  by  a  pair  of  opposing  PMs,  one  in  bicep  and  one  in  tricep  position.  We 
first  derive  a  mathematical  model  of  the  arm  with  antagonistic  PM  actuators,  then 
formulate  the  sliding  mode  controller  to  produce  accurate  tracking  of  the  elbow  angle. 
We  address  the  inportant  problem  of  bicep/tricep  static  internal  pressures  producing 
stable  arm  motion  when  control  is  absent.  Finally,  we  present  the  results  of  computer 
simulations  of  the  arm  under  the  conditions  of  different  actual  arms  (hence  different  truth 
models),  and  temperature  variations. 


4.2  Planar  Ann  Dynamic  Model 

In  this  chapter,  the  PM  is  modeled  with  the  Reynolds  model  (2.3.1),  (2.3.2).  From 
(2.3.1),  the  total  force  exerted  by  the  PM  on  the  mass  is  F  —  Bx  —  Kx.  The  internal 
pressure  P  of  the  PM  is  an  independent  control  variable  that  can  be  externally  commanded 
by  adjusting  the  inflation  and  deflation  solenoids.  If  several  PMs  are  present,  each  one 
has  its  own  F,  K,  and  B  coefficients,  its  own  internal  pressure  P,  and  its  own  inflation  or 
deflation  status.  In  this  chapter,  we  assume  all  PMs  are  identical  so  their  coefficients  are 
the  same. 

Consider  the  planar  manipulator  configuration  shown  in  Figure  4. 1 ,  which  depicts  an 
arm  actuating  a  mass  with  PMs  in  bicep  and  tricep  positions.  The  upper  arm  remains 
stationary  as  the  PMs  expand  and  contract,  moving  the  forearm.  The  upper  ends  of  the 
PMs  and  upper  arm  are  attached  to  a  stationary  reference  point.  A  mass  M  is  held  at  the 
end  of  the  forearm.  The  forearm,  which  is  considered  massless  without  loss  of  generality, 
is  attached  to  the  upper  arm  by  a  fiictionless  revolute  joint,  or  "elbow."  The  PMs  are 
attached  to  the  forearm  at  point  A,  which  is  a 
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Figure  4.1  -  Planar  arm  with  PMs  in  bicep/tricep  positions 

distance  a  from  the  joint's  axis  of  rotation.  The  distance  from  the  center  of  mass  of  the 
load  to  the  joint  is  L.  Also  at  the  joint  is  a  frictionless  pulley  of  radius  r,  over  which  a 
cable  connecting  the  tricep  PM  with  the  forearm  passes.  The  tricep  cable  makes  an  angle 
a  =  sin“^  (r/a)  with  the  forearm.  The  forearm  is  free  to  rotate  through  an  angle  6,  where 
0  =  0°  corresponds  to  the  arm  being  folly  straightened,  i.e.  the  mass  in  the  extreme 
downward  position,  and  6  =  180°  corresponds  to  the  arm  being  fully  bent,  i.e.  the  mass  in 
the  extreme  upward  position.  If  the  bicep  PM  contraction  is  and  the  tricep  PM 
contraction  is  Xt,  we  have  =  a(l  —  cosO)  and  Xt  =  a(l  4-  cos0). 

Let  the  bicep  PM  internal  pressure  be  Pj,  and  the  tricep  PM  internal  pressure  be  Pt. 
Also  let  the  bicep  PM  coefficients  be  Ft,,  Bb,  Ki,  and  the  tricep  PM  coefficients  be  Ft,  Bt, 
Kf  Then,  from  (2.3.1)  the  clockwise  torque  exerted  by  the  bicep  PM  on  the  fi)rearm  is 
(FbiPb)  —  Kb{Pb)xb  —  jB6(Pfr)ifr)asin0  and  the  counterclockwise  torque  exerted  by  the 
tricep  PM  is  (Pt(Pt)  —  Kt{Pt)xt  —  Bt{Pt)xt)r.  The  equations  of  motion  of  the  system 
of  Figure  4.1  can  be  derived  by  summing  torques  about  the  elbow  [22]: 


19  =  {Fb  —  KbXb  —  BbXb)asm9  —  (Ft  —  KtXt  —  BtXt)r  —  MgLsmB  (4.2.1) 
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where  I  =  ML^  is  the  moment  of  inertia  of  the  mass  about  the  elbow,  g  is  the 
acceleration  of  gravity,  and  MgLsmO  is  the  counterclockwise  torque  imparted  to  the 
forearm  by  gravity  acting  on  the  mass.  Note  that  since  the  bicep  force  is  multiplied  by 
asin0,  the  bicep  loses  controllability  at  ^  =  0°  and  $  =  180°.  For  this  reason,  the  arm 
angle  should  be  kept  away  from  these  extremes.  This  situation  could  be  avoided,  e.g.  by 
wrapping  the  bicep  cable  around  the  elbow  pulley  also.  The  tricep  does  not  suffer  this 
drawback  since  its  cable  always  makes  an  angle  of  a  =  sin“^(r/a)with  the  forearm 
regardless  of  9. 

From  (2.3.2),  let  Fi,  =  Fq+  FiPb,  Ft  =  Fq  +  Ki,  =  Kq  +  KiPb, 

Kt  —  Kq  +  KiPt,  Bf,  =  Bq  +  BiPb,  and  Bt  =  Bq  +  BiPt  where  Fq  —  179.2, 
=  1.39,  Kq  =  5.71,  Ki  =  0.0307,  and  Bq,  Bi  depend  on  whether  the  PM  in  question 
is  being  inflated  or  deflated,  as  follows: 


r  1.01,  inflation 
( 0.6,  deflation 


(4.2.2a) 


j  0.00691,  inflation 
(  —  0.000803,  deflation 


(4.2.2b) 


The  internal  bicep  and  tricep  pressures  Pj  and  P*  are  the  control  variables  that  can  be 
independently  commanded  by  the  controller  as  inputs  to  the  system.  Note  that  the  PM 
dynamics  depend  on  whether  the  PM  is  being  inflated  or  deflated. 

Let  us  assume  that  the  bicep  and  tricep  internal  pressures  are  given  by 


Pb  =  Pbo  +  Ap  (4.2.3a) 

Pt  =  Pto-Ap  (4.2.3b) 


where  Pjo  is  a  nominal  constant  internal  bicep  PM  pressure,  P^o  is  a  nominal  constant 
internal  tricep  PM  pressure,  and  Ap  is  an  arbitrary  function  of  time.  With  the  definitions 
in  (4.2.3),  (4.2.1)  can  be  written  as  a  single-input  single-output  system,  with  input  Ap  and 
output  9. 

According  to  (4.2.3),  the  tricep  deflates  when  the  bicep  inflates  and  vice  versa. 
Therefore,  one  set  of  B  parameters  will  apply  to  one  of  the  PMs  while  the  other  set 
applies  to  the  other  PM  at  a  given  time.  When  the  inflation/deflation  status  of  the  PMs 
changes,  they  trade  B  parameters.  We  denote  the  bicep  B  coefficients  as  Po6  and  B^, 
and  the  tricep  B  coefficients  as  Pot  and  P^ . 
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4.3  Bicep  and  Tricep  Nominal  Pressures  for  Desired  Equilibrium  Angle 

In  order  that  the  arm  has  a  stable  equilibrium  at  an  angle  6e,  we  seek  to  balance  the 
steady-state  clockwise  and  counterclockwise  torques  about  the  elbow  by  choice  of  bicep 
and  tricep  nominal  pressures  Pto  and  Pjo-  From  (4.2.1),  the  total  steady-state  clockwise 
torque  is 


Tew  =  [FiPbo  -  {Ko  +  KiPbo)a{l  -  cos0)]asin0  (4.3.1) 


and  the  total  steady-state  counterclockwise  torque  is 


Tcc  =  [PiPto  -  {Kq  +  KiPtQ)a{l  -I-  cos0)]r  -I-  MgLsinO  (4.3.2) 


for  a  constant  angle  9.  Equating  these  torques  at  a  desired  equilibrium  angle  9^  results  in 
the  following  relationship  between  Pbo  and  P^o' 


Pfto  =  mPto  -b  c 


(4.3.3) 


where 


m  =  [Pi  —  Kia{l  +  cos9e)]r/D  (4.3.4a) 

c  =  {MgLsmBg  —  Koar{l  +  cos^e)  •+•  Koa^{l  —  cos9e)sm9e)/D  (4.3.4b) 


and 


D  =  [Fi—  Kia{l  —  cos0e)]asin^e 


(4.3.5) 


Note  that  the  denominator  D  is  nonzero  for  all  arm  angles  in  the  open  interval  (0,  tt). 

Thus  we  have  a  relationship  between  the  nominal  bicep  and  tricep  pressures  that 
depends  on  the  system  coefficients,  the  mass  M,  and  the  desired  equilibrium  angle. 
Therefore,  one  of  them  (say  P^o)  may  be  chosen  arbitrarily,  and  the  other  (Pjo)  is 
determined  by  this  choice.  These  nominal  pressures  will  be  used  in  equations  (4.2.3)  to 
determine  the  total  bicep  and  tricep  pressures.  The  significance  of  these  nominal  pressures 
is  that  if  the  control  input  Ap  =  0,  the  elbow  angle  reverts  to  the  equilibrium  angle  9e, 
thus  it  is  weU-behaved  even  if  the  control  loop  is  opened.  In  practice,  these  pressures 
could  be  easily  determined  experimentally,  e.g.  by  setting  Pjo  to  some  nominal  value,  then 
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adjusting  Pto  until  the  desired  equilibrium  arm  angle  is  obtained.  In  our  simulations 
below,  we  arbitrarily  choose  Og  =  90°. 


4.4  Sliding  Mode  Control  for  Planar  Arm  with  PMs  in  Bicep/Tricep  Positions 

Combining  (4.2.1)  with  the  above  relationships  for  F,  B,  and  K,  we  obtain  the 
following  2nd  order  equation  describing  the  system  of  Figure  4.1: 


e  =  f{e,e)  +  b{d,  (9)Ap 


(4.4.1) 


where 


(4.4.2a) 

r=l 

6 

b(e,e)=Y,bii,(0.e)  (4.4.2b) 

1=1 

with  =  Osin^O,  ^2  =  sin0(Cos0  -  1),  ^3  =  ^sin^,  ^4  =  1  +  cosO,  ^5  =  sm0,  =  1, 
/i  =  ( —  Ffto  —  Bi,iPf^)a^/I,  /2  =  (Kq  +  KiPbo)a^/I,  fz  =  {  —  Bto  —  BtiPto)ar/I, 
=  (Ko  + KiPto)ar/I,  /s  =  (aFiPjo  -  MpL)//,  /e  =  -  FiPtor//, 

61=  -BibO^fl,  b2=Kia^/I,  hz=BitarfI,  64=  -Kiar/I,  65  =  Pi  a//,  and 

h  =  Fir/L 

Because  of  our  imperfect  knowledge  of  coeflBcients  F,  K,  and  B,  we  must  assume 
that  f{9,  9)  and  h{9,  9)  in  (4.4.1)  are  imprecise.  Assume  the  extent  of  the  imprecision  on 
/  can  be  bounded  by  a  known  continuous  function  of  9  and  9.  Similarly,  we  assume  that 
the  extent  of  the  imprecision  on  b  can  be  bounded  by  a  known,  continuous  function  of  ^ 
and  9.  The  control  problem  is  to  get  the  elbow  angle  9(t)  to  track  a  desired  trajectory 
9*{t)  in  the  presence  of  model  imprecision  on  /and  b. 

Assume  we  have  an  estimate  f{9,  9)  of  /  and  let  F{9, 9)  be  a  positive  function  such 

that 


|7  -  /I  <  P  (4.4.3) 

Further  assume  the  control  gain  b{9,  9)  is  unknown  but  that  we  have  upper  and  lower 
bounds  for  it,  i.e.  we  have  quantities  6min  and  bmsx.  such  that  0  <  bmm  <b<  6max  where 
6min  and  bniax  uiay  depend  on  9  and  9.  Define 
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(4.4.4) 


T  —  ‘s/^msx./^a 


Let  the  estimation  of  6  be  6  =  \/6min&inax  • 

Let  6*{t)  be  a  smooth  function  of  time  that  represents  the  desired  angular  trajectory 
for  the  elbow  angle.  Consider  the  sliding  surfece  cr  =  0,  with 


rO  rsj 


C  -  0  "I"  fJiO 


(4.4.5) 


where  0  =  0  —  0*  is  the  tracking  error  and  /*  is  a  scalar  design  parameter.  Then  the 
sliding  mode  control  law  is  given  by 

Ap- 


where 

Ap=i-f  +  0* ) 


9>7(^  +  e)  +  (7-l)l  Ap| 

e  is  a  positive  constant,  and  T  is  the  thickness  of  a  "bovmdary  layer,"  which  is  a 
neighborhood  of  the  sliding  surface  introduced  to  reduce  control  chattering.  Thus  the 
boundary  layer  is  defined  by  jB  =  {{0,  0) :  \o'(0,  0)\  <  T}.  We  have  the  following  result 
concerning  the  tracking  accuracy  of  the  above  closed-loop  system. 

Theorem:  Consider  the  planar  arm  with  PMs  in  bicep/tricep  configuration  (Figure 
4.1).  Let  the  bicep  and  tricep  pressures  be  as  in  (4.2.3)  and  (4.4.6)  where  0*{t)  is  a 
smooth  function  of  time  such  that  0*  (t)  €  (0,  tt)  Vt .  Then  the  elbow  angle  0{t)  satisfies 

.  lim^sup  |0(t)-0*(t)|  <  ^  (4.4.9) 

to  ^oot>tQ  M 


(4.4.6) 

(4.4.7) 

(4.4.8) 
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Proof:  It  is  straightforward  to  show  that  the  surface  cr  =  0  is  a  moving  straight  line 
of  slope  —  (I'm  {6,  9)  space  passing  through  the  moving  point  {9*,  9*).  Consider  the 
trajectory  for  an  arbitrary  initial  condition  9{0)  outside  the  boundary  layer,  i.e.  such  that 
|or(0)|  >  T.  Differentiating  (4.4.5)  and  using  (4.4.6)  with  \a\  >  T,  we  have 


,  .  l-bj  +  {b-  b)(9*  -  /i?  )  -  69Sgn(a)] 

(T  —  J  ^ - ^ 


(4.4.10) 


Evaluating  the  derivative  of  V  =  in  the  region  \ct\  >  T,  we  have 
tV  _  f  ^  ,  [-bf+{b-  b)(e*  -f^)-  bgsgaio)]  \ 

v  =  ay+ - j  I 

=  o{f  —  bb  ^f)  +  a{l  —  bb  ^){ —  9*  +  fx9  )  —  bb  q\a\  (4.4.11) 


Then  we  have 


V  <  -  €\a\  (4.4.12) 


provided 


q  >  eb-^b  +  16-^6/  -f  +  -  1)(  -9*  +  t^  )\  (4.4.13) 

Using  (4.4.3),  we  have  that  (4.4.13)  is  satisfied  if 

q  >  eb-^b  +  b-^bF  +  \b-^b  -l\\f-9*+fi^  \  (4.4.14) 


Since  7  >  b~^b  ,  we  have  that  (4.4.14)  is  satisfied  if  9  is  as  in  (4.4.8).  Thus  the  state 
trajectory  approaches  the  boundary  layer  for  all  initial  conditions  outside  it. 

Now  consider  an  arbitrary  point  (^(0),  ^(0))  such  that  |cr(0(O),  ^(0))|  >  T,  and  let 
ty  be  the  time  taken  for  the  system  trajectory  to  reach  the  edge  of  the  boundary  layer  firom 
this  point.  Integrating  (4.4.12)  fi-om  t  =  0  to  tT  and  considering  initial  points  {9,  9)  on 
both  sides  of  the  boundary  layer  results  in 
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<r<|k(0)|  (4.4.15) 

Therefore,  from  any  initial  state  {9,  6)  outside  the  boundary  layer,  the  system  trajectory 
reaches  the  boundaiy  layer  in  a  finite  time,  and  (4.4.12)  guarantees  that  the  boundary  layer 
is  invariant,  i.e.  \cr\  <T,'it> 

For  t  >  txi  we  can  use  (4.4.5)  to  find  the  Laplace  transform  of  6  {t): 

9  (s)  =  -^a(s)  (4.4.16) 

which,  together  with  the  fact  that  \a\  <T,\/t>  tr,  easily  yields  (4.4.9).  □ 

Therefore,  we  have  that  the  sliding  mode  control  law  (4.4.6)  guarantees  that  the 
state  trajectory  reaches  the  boundary  layer  in  a  finite  time  whatever  the  initial  state,  and 
inside  the  boundary  layer  constrains  trajectories  to  stay  inside  it  for  all  later  time  and 
approach  a  neighborhood  of  ^  =  0  asymptotically  as  t  — >  oo  (4.4.9).  Asymptotic 
tracking  of  the  elbow  angle  to  within  a  guaranteed  accuracy  is  therefore  obtained  in  spite 
of  modeling  errors  which  may  be  present  in  PM  coefficients,  masses,  distances,  etc. 


4.5  Simulation  Results 

The  planar  arm  of  Figure  4.1  with  PMs  in  bicep/tricep  pair  configuration  is 
simulated  using  a  4th-order  Runge-Kutta  algorithm  with  a  step  size  of  0.01  seconds.  Let 
L  =  0.4572  m,  a  =  0.1016  m,  and  r  =  0.0762  m. 

We  investigate  closed-loop  tracking  performance  for  two  reference  trajectories. 
One  of  these  is  a  sum  of  sinusoids  and  the  other  is  a  square  wave-type  function  with 
smooth  transitions  between  constant  values  rather  than  discontinuous  jiraips.  We  begin 
with  a  nominal  plant  model  and  assume  the  true  system  is  within  ±50%  of  this  nominal 
model.  We  observe  the  tracking  performance  for  three  different  actual  plants  within  this 
range,  vsdth  the  three  plants  defined  by  choosing  all  B,  K,  and  F  coefficients  randomly 
within  this  range.  We  also  simulate  ten:q)erature  variation  effects  by  letting  the  coefficients 
B  and  K  decrease  slowly  over  the  time  of  the  simulation.  This  simulates  the  effect  of  the 
PMs  temperature  increasing  with  use  over  time.  For  all  simulations,  we  choose 

q  =  1.1[7(F  ±  e)  ±  (7  -  1)|  Ap]],  satisfying  (4.4.8). 
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Modeling  Errors 

Case  1 :  The  desired  trajectory  for  the  joint  is 


6*{t)  =  ^  +  0.5(sin(27r/ii)  +  sin(27r/2t)  +  sm(27r/3<))  (4.5.1) 


with  /i  =  0.02  Hz,  /a  =  0.05  Hz,  and  /g  =  0.09  Hz. 

Let  M  =  21.89  kg,  ^  =  10  and  c  =  10  and  e  chosen  by  trial  and  error  to  yield 
good  performance).  We  choose  the  boxmdaiy  layer  thickness  as  T  =  1.  From  (4.4.9),  we 
have  that  the  guaranteed  tracking  precision  is  T//!r  =  0.1  radians.  Also  let  Pfo  =  275.79 
kPa  (40  psi),  Pjo  =  371.49  kPa  (53.88  psi)  for  a  Be  of  7r/2,  satisfying  (4.3.3). 

Assume  that  the  true  values  of  f{6,  6)  and  h{6,  6)  in  (4.4.1)  are  known  to  M  within 
±  50%  of  the  best  estimates  we  have  of  them,  which  are  f{6,  6)  and  b{9,  6).  Then  we 

have  F  =  0.5|7|,  6max  =  1-5^  &mm  =  0.56,  and  7  is  determined  as  1.7321  by  (4.4.4). 

The  sliding  control  input  to  the  PM  is  given  in  (4.4.6)  with  parameters  defined  as 

above.  The  tracking  performance  for  a  typical  actual  arm  within  ±50%  of  the  model  /,  6 
is  shown  in  Figure  4.2.  Figure  4.3  shows  the  tracking  errors  for  three  different  actual 
arms  /,  6  within  this  range,  using  the  same  controller.  It  is  seen  that  for  all  systems  the 
tracking  error  is  within  predicted  boimds. 
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Time  (seconds) 

Figure  4.3  -  Tracking  errors  for  three  different  actual  plants,  M  =  21.89  kg. 


Figure  4.4  shows  a  typical  control  effort  Ap  for  the  above  controller.  It  is  evident 
that  input  pressure  varies  smoothly  without  any  obvious  chattering.  Therefore,  by  using 
the  sliding  mode  controller,  the  arm  with  PMs  in  bicep/tricep  configuration  achieves 
desired  performance  with  good  tracking  precision  and  no  obvious  chattering  for  all  three 
actual  arms. 
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Figure  4.4  -  Control  effort,  M  =  21.89  kg. 


In  practical  applications,  it  may  be  expected  that  the  mass  actuated  by  the  arm  will 
change.  To  investigate  the  robustness  of  the  sliding  controller  to  changing  masses,  we 
decreased  the  mass  M  by  a  factor  of  3  to  7.30  kg  and  used  the  same  controller  as  above. 
Figure  4.5  shows  tracking  errors  for  three  different  actual  arms  randomly  chosen  within 
the  ±50%  range.  Tracking  error  is  again  within  predicted  bounds. 
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Figure  4.5  -  Tracking  errors  for  three  different  actual  plants,  M  =  7.3  kg. 

Figure  4.6  shows  a  typical  control  effort  when  M  =  7.3  kg.  Note  that  the  control  effort 
is  smaller  than  the  M  =  21.89  kg  case,  which  is  to  be  expected  since  a  lighter  mass  is 
being  moved. 


The  mass  actuated  by  the  arm  will  be  practically  limited,  because  very  heavy  masses 
would  require  the  input  pressure  to  make  the  bicep  and/or  tricep  pressures  go  outside 
the  allowed  range  of  PM  internal  pressure  (206.84  -  896.32  kPa  in  this  simulation).  These 
limits  would  have  the  effect  of  a  saturation  nonlinearity  on  the  PM  input.  The  limitation 
on  Ap  is  not  the  sliding  controller's  shortcoming;  it  is  merely  an  acknowledgment  of  the 
practical  constraint  that  the  PM  internal  pressure  must  be  kept  within  reasonable  bounds 
to  protect  against  actuator  failure.  If  more  force  is  desired,  several  PMs  can  be  placed  in 
parallel. 

Case  2:  To  further  verify  the  sliding  mode  controller,  another  simulation  is  performed  to 
track  a  pseudo-square  wave  signal  with  a  typical  system  within  the  ±50%  range.  Here, 
the  desired  trajectory  is 


(  37r/4, 

=  <  f  ±  7rsin(27r/i  {t  -  3)), 
[7r/4, 


sin(27r/i(t  -  3))  >  J 
|sin(27r/i(t-3)|<i  (4.5.2) 

sin(27r/i(t-3))  <  -J 


with  /i  =  0.1  Hz.  This  function  transitions  between  constant  values  of  7r/4  and  37r/4 
smoothly  rather  than  with  discontinuous  jumps.  For  the  design  parameters,  we  used 
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=  10,  e  =  10,  and  T  =  0.3.  Therefore,  the  guaranteed  tracking  accuracy  is 
T//i  =  0.03  radians.  We  chose  =  637.29  kPa  and  Pjo  =  206.84  kPa  for  this 
simulation.  These  were  chosen  so  that  adequate  tracking  could  be  achieved  with  PM 
pressures  in  the  permissible  range. 

Figures  4.7  and  4.8  show  the  tracking  performance.  Figure  4.8  indicates  the 
tracking  error  exceeds  that  predicted  by  the  theorem,  i.e.  the  spikes  in  Figure  4.8  exceed 
the  predicted  maximum  of  T/^i  =  0.03.  Figure  4.9  shows  the  control  input  Ap,  which  is 
seen  to  saturate  at  approximately  206.84  kPa  and  —  689.48  kPa.  These  are  values  of  Ap 
that  cause  the  bicep  or  tricep  PM  pressure  is  go  outside  the  allowed  pressure  range  with 
the  values  of  Pjo  and  P^o  above.  The  spikes  in  Figure  4.8  occur  at  times  of  input 
saturation,  i.e.  at  these  times  the  foil  input  pressure  dictated  by  the  sliding  mode  controller 
is  not  applied  and  tracking  accuracy  is  lost.  For  this  reason,  care  should  be  taken  to  insure 
that  PM  pressure  commanded  by  the  controller  will  always  be  within  the  permissible  range 
for  the  PM.  This  requirement  on  control  effort  is  always  present  in  any  practical  control 
system.  Again,  if  more  force  is  needed,  several  PMs  should  be  used  in  parallel. 


Figure  4.7  -  Tracking  performance,  M  =  21.89  kg. 
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Figure  4.8  -  Tracking  error,  M  =  21.89  kg 


Heating  Effects 

When  the  PM  is  operated  for  extended  periods  of  time,  friction  results  in  heating  of 
the  rubber  bladder,  changing  its  characteristics.  We  assume  this  has  the  effect  of 
decreasing  the  B  and  K  coefiBcients  of  the  PM,  thus  decreasing  the  needed  control  effort 
to  perform  a  task.  This  is  due  to  the  frict  that  the  friction  and  spring  effects  opposing  the 
PM  motion  are  decreased  as  B  and  K  decrease. 

Assume  the  cold  values  of  the  PM  coefficients  are  1.5  times  the  nominal  values 
given  in  Section  2,  and  over  a  period  of  60  seconds  of  operation  these  values  decrease 
linearly  to  0.5  times  the  nominal  values.  Applying  the  sinusoidal  reference  signal  of  Case  1 
above  with  M  =  21.89  kg  and  the  same  controller  as  above,  the  elbow  angle  is  nearly 
indistinguishable  from  the  reference  trajectory,  i.e.  tracking  performance  is  excellent  and 
well  within  predicted  boimds.  This  is  to  be  expected  since  with  changing  coefficients  the 
arm  becomes  a  slowly  time-varying  system.  It  is  well-known  that  slowly  varying  systems 
pose  no  problem  for  sliding  mode  controllers  [20].  Figure  4.10  shows  the  control  input 
Ap.  Comparing  Figure  4.10  with  Figure  4.4,  the  effects  of  the  decaying  B  and  K 
coefficients  is  seen  as  the  control  effort  must  be  larger  in  the  beginning  due  to  larger  B 
and  K  coefficients,  but  decreases  toward  the  end  as  B  and  K  decrease. 


Figure  4.10  -  Control  effort  with  heat  effects,  M  =  21.89  kg. 
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5  Sliding  Mode  Control  of  Planar  Arm  with  Four  PMs 


5.1  Introduction 

The  high  power/weight  and  power/volume  ratios  of  PMs,  in  conjunction  with  their 
contractile  nature  (i.e.  their  inherent  extensibility  when  activated),  could  make  these 
actuators  extremely  useful  in  a  variety  of  rehabilitation  engineering  applications  for 
persons  with  neuromuscular  or  musculoskeletal  pathologies  that  affect  extremity  function. 
Disorders  that  limit  or  hinder  extremity  function  can  include  stroke,  traumatic  brain  injury, 
amputation,  and  spinal  cord  iigury.  These  conditions  are  associated  with  a  number  of 
fimctional  deficits,  including  weakness,  paralysis,  limb  loss,  and  joint  contracture 
associated  with  spasticity  and/or  hypertonia.  Individuals  with  such  deficits  could 
potentially  benefit  fi-om  powered  devices  that  provide  joint  loading  to  generate  or  assist 
extremity  motion,  or  maintain  extension  of  muscles  with  contractures. 

.  Many  potential  applications  involve  some  type  of  exoskeletal  or  link  segment 
configuration  that  attaches  to  existing  anatomical  body  segments  ([6],  [23]-[26]).  Devices 
of  these  types  have  often  been  relegated  to  use  within  a  specific  location  (i.e.  laboratory 
setting),  or  have  limited  actuation  to  small  segments  at  the  hand/wrist.  Lower 
power/volume  ratios  of  traditionally  used  power  sources  could  hinder  the  use  of  powered 
exoskeletal  orthoses,  for  larger  extremity  segments,  in  more  general  settings.  Utilization 
of  PMs  to  power  exoskeletal  devices,  which  interface  with  persons  who  have  functional 
deficits,  could  potentially  reduce  size  and  weight  sufficiently  to  fecilitate  more  widespread 
use  of  such  devices. 

It  should  also  be  noted  that  traditional  actuators,  such  as  electric  motors,  are 
typically  rigid  in  nature.  Such  actuators  can  create  imcomfortable  or  painful  conditions 
when  interfacing  with  humans.  For  example,  if  an  exoskeletal  brace,  actuated  by  a  stepper 
motor  pulley  arrangement,  is  activated  to  extend  a  contracted  joint  to  some  desired  angle, 
and  a  flexor  spasticity  episode  occurs,  the  stepper  motor  will  rigidly  hold  its  position  and 
not  permit  joint  flexion.  Such  a  response  could  result  in  pain  and  discomfort  among 
patients  with  joint  contractures  associated  with  spasticity.  Conversely,  activated  PMs 
maintain  inherent  extensibility,  which  could  permit  some  joint  flexion  during  muscular 
loading  due  to  episodes  of  spasticity.  While  the  elastic  properties  of  PMs  can  complicate 
the  control  aspects  of  these  actuators,  they  can  potentially  contribute  to  more  comfortable, 
devices  when  interfecing  with  human  limb  segments. 

Sliding  control  has  the  advantage  that  it  can  provide  accurate  tracking  with  bounded 
error  in  the  presence  of  model  uncertainties.  Sliding  mode  control  is  ideal  for  PM  control 
because  the  PM  model  is  usually  poorly  known,  nonlinear,  and  time-varying,  necessitating 
some  type  of  robust  control  strategy.  This  accounts  for  the  success  of  adaptive,  variable- 
structure,  and  soft  confuting  approaches  also. 

This  chapter  considers  end-effector  (hand)  control  of  a  planar  arm  actuated  by  four 
PMs.  Two  PMs  in  agonist/antagonist  configuration  actuate  the  shoulder  joint,  and  two 
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PMs  in  agonist/antagonist  configuration  actuate  the  elbow.  The  contributions  of  this 
chapter  include  modeling  of  the  arm  with  four  PM  actuators,  formulating  the  model  so 
that  it  is  suitable  for  sliding  mode  control,  determination  of  static  internal  pressures  for 
stable  arm  behavior  in  the  absence  of  a  control  signal,  and  extensive  simulation  results 
including  the  effects  of  changing  temperatures. 


5.2  Planar  Arm  Dynamic  Model 

In  this  chapter,  the  PM  is  modeled  with  the  Reynolds  model  (2.3.1),  (2.3.2).  From 
(2.3.1),  the  total  force  exerted  by  the  PM  on  the  mass  is 

<f,  =  FiP)  -  B{P)x  -  K{P)x  (5.2.1) 


The  internal  pressure  P  of  the  PM  is  an  independent  control  variable  that  can  be  externally 
commanded  by  adjusting  the  inflation  and  deflation  solenoids.  If  several  PMs  are  present, 
each  one  generally  has  its  own  F,  K,  and  B  coefficients,  its  own  internal  pressure  P,  and 
its  own  inflation  or  deflation  status. 

If  a  pair  of  such  PMs  is  tied  together  aroimd  a  pulley  of  radius  r  as  in  Figure  5.1, 
with  the  connecting  line  rigidly  attached  to  the  pulley  to  prevent  slipping,  the  torque 
imparted  to  the  pulley  by  the  PM  pair  is 

Ttotal  =n-Tt  =  (<l>b  -  (t>t)r  (5.2.2) 

where  rj  and  are  the  torques  due  to  each  individual  PM  and  are  given  by 


n  =  {Fb  -  KbXb  -  BbXb)r  (5.2.3a) 

Tt  =  {Ft  -  Ktxt  -  Btxt)r  (5.2.3b) 


In  (5.2.3),  Xb  is  the  length  of  PM  b  anda^t  is  the  length  of  PM  t. 
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Figure  5.1  -  Two  PMs  tied  together  around  a  pulley 


The  lengths  of  the  arrows  in  Figure  5.1  are  indicative  of  the  forces  exerted  by  the  two 
PMs.  Thus,  (j)b  >  <t>t  and  the  torque  exerted  on  the  joint  is  clockwise.  The  total  torque 
delivered  to  the  pulley  is  given  by 


Ttotai  =  {Fh  —  ^bXb  —  BbXb  —  Ft  +  KfXt  +  BtXt)r  (5.2.4) 


where  Fb,  Kb,  and  Bb  depend  on  the  internal  pressure  of  PM  h  and  Ft,  Kt,  and  Bt  depend 
on  the  input  pressure  of  PM  t  according  to  (2.3.2). 

The  lengths  Xb,  can  be  expressed  in  terms  of  the  pulley  angle  d  since  the  pulley 
radius  is  known.  We  will  find  it  advantageous  to  do  so  in  order  to  formulate  a  two-input 
slicing  mode  controller  for  the  planar  arm  actuated  by  opposing-pair  PMs. 


53  Dynamics  of  Planar  Arm  Actuated  by  PMs 

Consider  the  manipulator  configuration  shown  in  Figure  5.2,  which  depicts  a  two- 
joint  planar  arm.  In  this  figure,  $i  denotes  the  angle  of  joint  i,  rrii  denotes  the  mass  of  link 
i,  li  denotes  the  length  of  link  i.  Id  denotes  the  distance  fi-om  the  previous  joint  to  the 
center  of  mass  of  link  i  (center  of  mass  is  denoted  by  a  small  dot),  and  Jj  denotes  the 
moment  of  inertia  of  link  i  about  an  axis  coming  out  of  the  page,  passing  through  the 
center  of  mass  of  link  i. 
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The  dynamics  of  this  system  are  well-known  [20]  to  be  described  by: 


D{e)e  +  c{e,  0)9  +  f{9)  =  T 


(5.3.1) 


where  0  =  [0i.  ^2]^  and  r  =  [r^,  Xe]^  is  a  matrix  of  input  torques.  The  nonsingular  inertia 
matrix  D(d)  is 


D{0):= 


dll 

<^21 


(5.3.2) 


where 
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dll  =  tniZ^i  +  m2(/i  +  II2  +  2lilc2Cos92)  + 11+ 12 

(5.3.3a) 

di2  =  ^21  =  ”^2(^c2  +  /i^c2COS02) 

(5.3.3b) 

daa  =  ”12/^2  +  ^2 

(5.3.3c) 

and  li  =  i=  1,2.  The  matrix  C(0, 0)  is  given  as 


0(0,  »)  = 


h92 

-Ml 


hd2  “I"  hOi 
0 


(5.3.4) 


with  h  =  -  7712^1 /c2sin02-  The  vector  f{6)  is  given  by  f{9)  =  [fi,  f2V  where 

/i  =  +  m2li)gcos9i  +  m2^c2^cos(^i  +  ^2)  (5.3.5a) 

/a  =  m2lagcos{9i  +  ^2)  (5.3.5b) 


and  g  is  the  acceleration  of  gravity. 

The  arrangement  of  PMs  on  the  manipulator  is  shown  in  Figure  5.3.  The  base  (or 
torso)  is  fixed.  At  the  top  of  the  torso  is  the  shoulder  revolute  joint,  which  is  a  pulley. 
The  upper  arm  is  attached  to  the  shoulder  joint,  which  is  rotatable  through  an  angle 
_  I  <  <  |.  At  the  end  of  the  upper  arm  is  the  elbow  revolute  joint,  which  is  another 

pulley.  The  forearm  is  attached  to  the  elbow  joint,  which  is  firee  to  rotate  through  an  angle 
0  <  ^2  < 
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Figure  5.3  -  Arm  with  PMs 

Assume  there  are  Ug  pairs  of  matched  PMs  (i.e.  all  PMs  have  identical  coefficients 
and  lengths)  tied  together  around  the  shoulder  pulley  (radius  r^)  with  all  tricep  PMs 
recieving  the  same  input  pressure  Ptg  and  all  bicep  PMs  recieving  the  same  input  pressure 
Pig.  Similarly,  assume  there  are  Ug  pairs  of  matched  PMs  tied  together  arovmd  the  elbow 
pulley  (radius  r^)  with  all  tricep  PMs  recieving  the  same  input  pressure  P^e  and  all  bicep 
PMs  recieving  the  same  input  pressure  We  do  not  assume  the  elbow  PMs  are 
matched  to  the  shoulder  PMs,  however.  The  assumption  of  matched  PMs  is  rather 
idealistic.  However,  if  care  is  taken  to  keep  constrution  of  individual  PMs  the  same  and  of 
the  same  materials  and  dimensions,  the  assumption  of  matched  PMs  may  not  be  too 
erroneous. 

In  Figure  5.3,  the  shoulder  and  elbow  torques  Tg  and  Tg  can  be  ejqjressed  as  in 
(5.2.4): 


Tg  =  ng{Fa  -  KgXts  -  BtsXts  -Fg+  KgXbs  4-  BtshsVs  (5.3.6a) 
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Tg  =  ne(Fe  —  KgXf^  —  BbeXbe  —  Fg  +  KgXte  +  BteXte)f’e 


(5.3.6b) 


where  Fs,  K„  Bts,  and  B^s  (Fe,  K^,  Bte,  and  Bbe)  are  the  coefficients  for  the  shoulder 
(elbow)  PMs,  t  subscripts  denote  tricep  PM  quantities,  b  subscripts  denote  bicep  PM 
quantities,  s  subscripts  denote  shoulder  PM  quantities  (i.e.  PMs  located  on  the  torso 
actuating  the  shoulder),  and  e  subscripts  denote  elbow  PM  quantities  (i.e.  PMs  located  on 
the  upper  arm  actuating  the  elbow). 

A  shoulder  angle  of  =  -  Tr/2  corresponds  to  the  shoulder  tricep  PMs  being 
fully  lengthened  and  the  shoulder  bicep  PMs  being  fully  shortened,  and  9i  =  +7r/2 
corresponds  to  the  shoulder  tricep  PMs  being  fully  shortened  and  the  shoulder  bicep  PMs 
being  fulfy  lengthened.  Therefore,  the  PM  lengths  Xts  and  x^s  can  be  expressed  in  terms 
of^i  as 


Xts  =  +  f)  (5.3.7a) 

=^5(2  (5.3.7b) 

Similarly,  with  $2  =  0  corresponding  to  full  shortening  of  the  elbow  tricep  PMs  and  full 
lengthening  of  the  elbow  bicep  PMs,  and  @2  =  corresponding  to  full  lengthening  of  the 
elbow  tricep  PMs  and  full  shortening  of  the  elbow  bicep  PMs,  the  PM  lengths  Xte  and  Xbe 
can  be  ejqjressed  in  terms  of  ^2  as 

Xte  =  TeiT^  -  02)  (5.3.8a) 

Xbe  =  re02  (5.3.8b) 

Let  the  input  pressure  of  the  shoulder  bicep  and  tricep  PMs  be 


Pbs=Pobs  +  ^Ps  (5.3.9a) 

Pis  =  Pots -^Ps  (5.3.9b) 

where  Pobs  and  Pots  are  arbitraiy  positive  nominal  constant  pressures  and  Aps  is  an 
arbitrary  function  of  time  that  is  commanded  by  the  controller.  With  these  definitions,  the 
set  of  ris  shoulder  antagonist  pairs  becomes  a  single-input  system  with  input  Apg.  When 
the  bicep  input  pressure  increases,  the  tricep  input  pressure  decreases  and  vice  versa, 
varying  the  torque  on  the  shoulder  joint.  The  nominal  constant  pressures  Po6s  and  Pots 
are  arbitrary  and  can  be  chosen  so  that  (1)  the  shoulder  joint  is  well-behaved  in  the 
absence  of  a  control  signal  Aps,  and  (2)  desired  joint  stiffness  is  produced  (see  Section 
5.4).  Similarly,  let  the  input  pressures  of  the  elbow  PMs  be  defined  as 

Pbe  =  Pobe  +  APe  (5.3.10a) 
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Pte  =  Pote  -  ^Pe 


(5.3.10b) 


where  Pobe,  Pote  are  arbitrary  positive  nominal  constant  pressures.  With  these  definitions, 
the  set  of  Wg  elbow  antagonist  pairs  becomes  a  single-input  system  with  input  Apg.  Thus, 
with  the  PM  input  pressures  defined  as  in  (5.3.9)  and  (5.3.10),  the  2-DOF  planar  arm  of 
Figure  5.3  is  a  2-input  system  with  input  Apa  determining  the  shoulder  torque  and 
input  Ape  determining  the  elbow  torque  Tg. 

With  definitions  (5.3.6)-(5.3.10),  we  can  write  the  shoulder  and  elbow  torques  as 


Ts  =  Tqs  +  TiaApa  (5.3.1  la) 

Tg  =  Tog  +  TigApe  (5.3.11b) 

where 

Tqs  =  Ws  [^05  +  PlsPots  {Kos  +  KisP {Pots  +  BltsPots)^ts 

—  Fqs  ““  FlsPobs  +  (-^05  +  FisPQhs)^bs  +  {Bobs  “I"  -^165-^065) ^65] ^5  (5.3,12a) 

^Is  —  —  Fxs  +  KisXts  -  BlisXts  -  -Pis  +  ^ls^6s  +  BihsXbs]'^' s  (5,3.12b) 

Toe  =  [Fqc  +  Fi^Pobe  (Foe  +  FiepQi^)xi^  —  (^obe  +  ‘Sl6ePobe)^be 
—  F^e.  Fx^P^te  +  (-^Oe  FxePoi^^te  (-®0<€  “I"  BxtePot^^t^'^e  (5.3. 12c) 

Txe  =  [^le  +  Fxe^be  ~  Bxhe^be  +  ^le  Fxe^te  +  BxteXte\Te  (5.3. 12d) 


In  (5.3,12)j  Pxts^  -^obs?  Bibs,  Bobe,  Bibe,  Bote  and  Bue  are  the  appropriate 

coeflScients  fi-om  (2.3.2c),  depending  on  whether  the  PMs  are  being  inflated  or  deflated. 

Combining  (5.3.1)  -  (5.3.12),  we  can  arrive  at  the  following  model  for  the  planar 
arm  actuated  by  four  PM  groups  as  in  Figure  5.3: 


where 


■ 

02. 


tti 

a2 


+  G 


Apa 

Ape 


(5.3.13) 


<5.3.14) 


and 
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Tu 

0 


0 

ne 


(5.3.15) 


G  =  D- 


Note  that  aj ,  a2 ,  and  G  are  functions  ofOi,  61,62,  and  62. 

The  planar  arm  actuated  by  +  Ue  pairs  of  PMs  in  opposing  pair  configuration 
modeled  as  in  (5.3.13)  is  now  in  a  form  which  can  be  handled  by  multi-input  sliding  mode 
control  methods. 


5.4  PM  Nominal  Pressures  for  Desired  Equilibrium  Position  of  Planar  Arm 

In  this  section  we  find  nominal  pressures  Pots>  Pob$y  Pote^  P )6e  such  that  the  arm 
lias  an  equilibrium  point  at  a  desired  constant  shoulder  angle  digq  and  an  elbow  angle  02eg* 
We  do  this  in  order  that,  if  control  is  lost,  the  arm  will  revert  to  the  desired  equilibrium 
position.  Another  reason  to  do  this  is  to  produce  desired  stifiBness  in  the  joint.  In  order  to 
find  proper  nominal  pressures,  we  find  relationships  between  them  to  balance  the  steady- 
state  clockwise  and  counterclockwise  torques  about  the  shoulder  and  elbow  as  functions 
of  the  desired  equilibrium  joint  angles.  For  this  analysis,  we  assume  all  PM  coeflBcients 
are  exactly  known.  If  they  are  not,  see  the  end  of  this  section. 

From  (5.3.6)-(5.3.8)  and  (5.3.12),  the  total  steady-state  clockwise  and 
counterclockwise  torques  about  the  shoulder  in  Figure  5.3  are 

Tacw  =  »^s[-Pos  +  PlsPobs  ~  i^Qs  +  KuPobsysi^  ~  ^leq)Vs  +  '^\2  (5.4.1a) 

Tsccw  —  Ws[-iPos  +  PlsP^ts  ~  {Kqs  +  KisPqis)''’ s{^leq  +  f )]**«  (5.4.1b) 

where 

Ti2  =  (m^/cl  +  *^2^l)^COS01e5  +  rn.2ic2^COS(01eq  +  d2e,)  (5.4.2) 

is  the  clockwise  torque  imparted  to  the  shoulder  by  gravity,  and  r*  is  the  radius  of  the 

shoulder  pulley.  .  > 

Equating  the  clockwise  and  counterclockwise  torques  results  in  the  following 

relationship  between  Po6s  and  Pot*: 


-Pots  =  ‘rrisPobs  +  (5.4.3) 


where 


49 


(5.4.4a) 


ms  -  [Fis  -  Kursi^  -  Oieq)]/As 

Cs  =  {2Kosrseieg  +  Ti2/nsrs)//^s  <5.4.4b) 

and 

A.  =  Fu-KurA9le,  +  l)  ,  (5-4.5) 

Similarly,  the  total  steady-state  clockwise  and  counterclockwise  torques  about  the 
elbow  are 

Tecw  =  ne[FiePote  “  (i^Oe  +  KuPoteY ^{1:  —  d2eq)Ve  +  '^2  (5.4.6a) 

Teccw  =  ’^e[^le^06e  “  (-^Q)e  +  -ft^le-fb6e)’'e^2e9]’'e  (5.4.6b) 

where 

T2  =  rn2^c2^C®s(0ieg  "b  ^2eq)  (5.4.7) 

is  the  clockwise  torque  imparted  to  the  elbow  by  gravity,  and  Vg  is  the  radius  of  the  elbow 
pulley. 

Equating  these  torques  results  in  the  following  relationship  between  Po6e  and  Pate' 

Pobe  ~  mgPQtg  -l-  Cg  (5.4.8) 

where 

mg  =  [Fu-Kure{'rr-e2eq)]/^e  (5.4.9a) 

Cg  =  [Kogrg{2e2gq  -  tt)  -I-  rz/werJ/Ae  (5.4.9b) 

and 
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—  Pie  P-le'^e^2eq 


(5.4.10) 


Thus  we  have  relationships  between  the  nominal  bicep  and  tricep  pressures  for 
shoulder  and  elbow  joints  that  depend  on  the  system  coefficients  and  the  desired 
equilibrium  angles.  Therefore,  for  a  given  shoulder  equilibrium  angle  9icq,  one  of  the 
nominal  shoulder  pressures  (say  Po6«)  could  be  chosen  arbitrarily  in  (5.4.3),  and  the  other 
(Pots)  determined  by  this  choice.  Similarly,  for  a  given  elbow  equilibrium  angle  62eq,  Pote 
could  be  chosen  arbitrarily  in  (5.4.8),  and  Pote  determined  by  this  choice.  These  nominal 
pressures  will  be  used  in  equations  (5.3.9)  and  (5.3.10)  to  determine  the  total  PM 
pressures.  The  significance  of  these  nominal  pressures  is  that  if  the  control  inputs 
Aps  =  Ape  =  0,  the  arm  orientation  reverts  to  the  equilibrium  angles  02eg-  Thus  the 
arm  is  well-behaved  even  if  the  control  loop  is  opened. 

In  practice,  if  the  exact  PM  coefficients  are  unknown,  these  pressures  could  be 
easily  determined  ejqjerimentally,  e.g.  by  first  setting,  say  Po^s  and  Po^  to  some  nominal 
values,  then  manually  adjusting  Po^s  and  Po^e  until  the  desired  equilibrium  arm  orientation 
is  obtained.  In  our  simulations  below,  we  arbitrarily  choose  9ieq  =  —  7r/4,  62eq  =  7r/2- 

It  should  be  noted  that  larger  nominal  pressures  increase  joint  stiffiiess  and 
decrease  joint  compliance.  An  advantage  of  decreasing  joint  compliance  is  reduced 
susceptibility  of  the  arm  to  impact  disturbances  such  as  unintended  hitting  of  the  arm  or,  m 
case  of  exoskeletons,  unintended  limb  motions,  which  might  decrease  tracking  accxiracy. 
A  disadvantage  of  decreased  compliance  is  less  flexibility  in  the  arm,  increasing  the 
chances  of  discomfort  and  possible  damage  when  the  arm  comes  in  contact  with  hmnans 
or  equipment,  or  in  the  case  of  exoskeletons,  greater  discomfort  in  the  event  of  limb 
spasms. 

Nominal  pressures  are  also  important  when  considering  the  total  PM  pressures  that 
will  be  necessary  to  accomplish  a  given  task.  PM  pressures  are  practically  limited  to  lie 
between  maximum  and  minimum  values,  depending  on  the  PM.  However,  a  given  task  of 
the  arm  can  be  accomplished  with  any  nominal  pressure,  if  joint  stiffiiess  is  not  a  concern. 
To  keep  all  PM  pressures  within  the  allowable  range,  it  is  generally  necessary  to  adjust  the 
nominal  PM  input  pressures  as  well  as  the  number  of  parallel  PMs  actuating  a  joint.  The 
necessary  pressures  wiU  depend  on  the  tracking  task  and  the  mass  to  be  actuated  by  the 
arm.  For  instance,  in  the  simulations  at  the  end  of  this  chapter,  the  link  masses  are  both  1 0 
kg.  In  order  to  accompish  the  tracking  tasks  while  keeping  all  PM  pressures  within  the 
allowable  range  for  the  PMs  under  consideration,  i.e.  206.844  <  P  <  620.532  kPa 
(30  <  P  <  90  psi),  it  is  necessary  to  use  6  pairs  of  PMs  actuating  the  shoulder  and  3  pairs 
for  the  elbow,  with  Pots  =  Pote  =  310.3  kPa  (45  psi).  Pots  =  449.6  kPa  (65.2  psi),  and 
Po6e  =  310.5  kPa  (45  psi),  satisfying  (5.4.3)  and  (5.4.8). 
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5.5  Two-input  Sliding  Mode  Control  for  Planar  Arm  Actuated  by  Four  Groups  of 
PMs 

Consider  the  model  (5.3.13)-(5.3.15)  of  the  planar  arm  actuated  by  four  PMs. 
Because  of  our  imperfect  knowledge  of  coefficients  F,  K,  and  B  for  all  PMs,  we  must 
assume  that  ai{6,  6),  02(6,  $),  and  G{6,  6)  are  imprecise.  Let  the  extent  of  the 
imprecisions  on  ai,  02,  and  G  be  bounded  by  known  continuous  functions  of  di,  $1,  $2, 
and  02*  Th®  control  problem  is  to  determine  torques  Tg  and  Tg  to  force  the  end  effector 
(i.e.  the  end  of  the  forearm)  to  follow  a  desired  path  in  the  spatial  variables  x  and  y  (see 
Figure  5.2)  in  the  presence  of  model  imprecision  on  ai,  02,  and  G.  It  is  straightforward  to 
convert  this  into  a  tracking  problem  for  the  shoulder  and  elbow  joint  angles  di(t)  and 
$2{t)  using  the  inverse  kinematics  of  the  arm. 

Assume  we  have  estimates  Ui,  02  of  Ui  and  02  such  that 

\ai  -  ai\  <  Ai  (5.5.1) 


for  some  known  positive  functions  Ai,  i  =  1, 2.  Further  assume  the  control  gain  matrix  G 
is  unknown  but  that  we  have  an  estimate  G  for  it  such  that 


G=(I  +  A)G 


(5.5.2) 


with  I  Ayj  <  6ij  for  i,  j  =1,2  where  6ij  are  known  positive  functions.  Also  we  assume 
that  G  is  nonsingular  over  the  entire  state  space,  and  that  G  is  invertible,  continuously 
dependent  on  the  parametric  uncertainty,  and  such  that  G  =  G  in  the  absence  of 
parametric  uncertainty. 

Let  9i(t)  and  be  smooth  functions  of  time  that  represent  the  desired 

trajectories  for  the  shoulder  and  elbow  angles.  Consider  the  two  sliding  surfoces  a,-  =  0, 
i  =  1,  2  with 


Oi  —  Oi  -h 


=  Oi  —  6r 


(5.5.3) 


where  di  =  di  —  6i  are  the  two  tracking  errors,  fii,  ^2  positive  scalar  design 
parameters,  and 


Oyi  — 


rsj 

% 


(5.5.4) 
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Then  the  tracking  problem  can  be  translated  into  finding  inputs  [Ap^  Ape]^  that  verify  the 
individual  sliding  conditions 


1^2^  I 


with  >  0  in  the  presence  of  parametric  imcertainty. 
Let  the  sliding  mode  control  law  be  given 


A;isgn((7i) 

A:2Sgn(a2) 


(5.5.5) 


(5.5.6) 


where  k\  and  ^2  are  positive  constants.  Then  since 

d\  =  dl  —  ai  .+  Aii((9ri  —  Oi)  +  Ai2(^rl  ~  ®2) 

-  Ai2A:2Sgn(a2)  -  (1  +  Aii)/!;isgn(ai)  (5.5.7a) 

<T2  =  U2  ~  <*2  +  A2i(^r2  ~  ®2)  +  ^22(^2  ~  ®2) 

-  A2iA:iSgn((ri)  -  (1  +  A22)A:2Sgn((T2)  (5.5.7b) 

the  sliding  conditions  (5.5.5)  are  verified  if 

(1  —  ^ii)/?!  >  .Ai  +  ~  ®i|  +  ^I2|^rl  “  ®2|  “  ^12^2  (5.5.8a) 

(1  —  622)^2  ^  -^2  +  ^2l|*^r2  “  ®ll  "b  ^22l^r2  ~  ®2|  ~  ^21^1  (5.5.8b) 


and,  in  particular,  if  fci  and  k2  are  chosen  such  that 

(1  —  5ii)/i:i  +  ^12^2  =  Ai  +  “  ^ll  +  ^12|<^rl  “  0'2|  +  Vi  (5.5.9a) 

(1  —  ^22)^2  +  ^21^1  =  A2  +  ^2\\^r2  ~  ®l|  +  ^22|^r2  ~  ^2!  +  %  (5.5.9b) 


The  Frobenius-Perron  Theorem  [28]  guarantees  that  equations  (5.5.9a)  and 
(5.5.9b)  have  a  unique  nonnegative  solution  [fci,  ^2].  Therefore,  the  control  law  (5.5.6) 
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with  ki,  k2  defined  by  (5.5.9a)  and  (5.5.9b)  satisfies  the  sliding  conditions  (5.5.5)  in  the 
presence  of  parametric  uncertainties  bounded  as  in  (5.5.1),  (5.5.2). 

The  control  law  (5.5.6)  is  known  to  cause  chattering  due  to  the  discontinuities 
across  the  sliding  surfaces.  Chattering  can  cause  excessive  wear  on  the  PM  valves,  and 
therefore  is  undesirable.  To  reduce  this,  we  introduce  boundary  layers  in  the  vicinity  of 
the  sliding  surfaces.  Inside  the  boundary  layers,  the  control  laws  are  linear  and 
continuous.  Outside  the  boundary  layers,  the  control  laws  have  the  form  given  in  (5.5.6). 
With  bovmdaiy  layers,  the  complete  multi-input  sliding  mode  control  law  for  the  arm  is 
given  by 


K\ 

A2. 


Cl 


A:isat(<Ti/ri)j\ 

fc2sat(<r2/r2)J  J 


(5.5.10) 


where 


\y\  <  1 

otherwise 


(5.5.11) 


and  Fi  and  r2  are  the  boundary  layer  thicknesses,  i.e.  the  boundary  layers  are  defined  by 

Li  =  {(^1,  (9i)  :  Wx(du  (9i)|  <  Fi}  (5.5.12a) 

L2  =  m,  02)  :  ^2(02,  e2)\  <  Ta}  (5.5.12b) 


Thus,  when  the  state  trajectories  are  outside  their  respective  boimdary  layers,  the 
trajectories  approach  and  reach  the  boundary  layers  in  finite  times  due  to  (5.5.7a)  aiid 
(5.5.7b)  when  ki,  k2  are  chosen  to  satisfy  (5.5.9a)  and  (5.5.9b).  Once  inside  the  boundary 
layers,  the  state  trajectories  9i  and  62  are  governed  by  (5.5.3),  i.e.  taking  the  Laplace 
transform  of  0,-  {t)  we  have 


1 

s  +  /^i 


(5.5.13) 


which,  together  with  the  feet  that  |o-i|  <  F,-  Vt  >  ti  with  ti  finite,  yields 
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(5.5.14) 


lim  sup|0i(i)  -  ^ 

<0  -*  00  t  >  to 


for  i  =  1,2. 

Therefore,  we  have  that  the  sliding  mode  control  law  (5.5.10)  guarantees  that  the 
state  trajectories  reach  their  respective  boundary  layers  in  finite  times  whatever  the  initial 
states,  and  inside  the  boundary  layers  constrains  trajectories  to  stay  inside  them  for  all  later 
time  and  approach  neighborhoods  of  =0  asymptotically  as  t  — >  oo.  Asymptotic 
tracking  of  the  shoulder  and  elbow  angles  to  within  guaranteed  accuracy  is  therefore 
obtained  in  spite  of  modeling  errors  which  may  be  present  in  PM  coefBcients,  masses, 
distances,  etc.  Thus  the  desired  spatial  path  is  followed  by  the  end  effector  within  an  error 
bound. 


5.6  Simulation  Results 

The  planar  arm  of  Figure  5.3  with  opposmg-pair  PMs  of  the  type  described  in 
Section  5.2  actuating  the  shoulder  and  elbow  joints  is  simulated  using  a  4th-order  Runge- 
Kutta  algorithm  with  a  step  size  of  0.01  seconds.  Let  Zj  =  ^2  =  0.46  m.  Id  =  lc2  =  0-23 
m,  mi  =  m2  =  10  kg,  =  7.62  cm,  and  Vg  =  5.08  cm,  Ug  =6  and  rig  =  3.  For  these 
simulations  we  assume  aU  physical  quantities  of  the  arm,  i.e.  masses,  lengths,  etc.  are 
exactly  known,  but  that  the  PM  coefBcients,  i.e.  F,  K,  and  B  are  not  known  with 
precision.  Assume  all  12  shoulder  PMs  (6  pairs)  are  matched  to  eaeh  other,  but  not  to  the 
elbow  PMs.  Similarly  assume  all  6  elbow  PMs  (3  pairs)  are  matched  to  each  other,  but 
not  to  the  shoulder  PMs. 

The  sliding  mode  controller  is  designed  according  to  (5.5.10)  with  ai  and  02  given 
by  (5.3.14)  and  G  given  by  (5.3.15)  using  ideal  values  for  all  F,  K,  and  B  coefficients. 
To  investigate  robustness  of  the  sliding  mode  controller,  we  randomly  choose  three  sets  of 
actual  (nonideal)  F,  K,  and  B  coefficients  from  a  uniform  distribution  within  ±50%  of 
their  nominal  values.  The  coefficients  used  are  listed  in  Table  5.1. 

The  coefficients  listed  in  Table  5.1  are  those  that  produced  the  greatest  tracking 
error  (while  maintaining  a  random  selection),  so  that  we  could  see  how  close  to  the 
predicted  error  bounds  (5.5.14)  the  actual  tracking  errors  were.  Other  choices  for 
coefficients  produced  less  tracking  error,  so  are  not  reported.  In  the  process  of  choosing 
which  sets  of  coefficients  to  use  for  the  simulations,  it  was  noticed  that  traeking  error  was 
by  far  the  most  sensitive  to  variations  in  the  coefficient,  Fi.  This  is  perhaps  not  surprising, 
since  from  (5.2.1),  (5.2.4)  the  PM  contractile  force  F  is  directly  proportional  this 
coefficient,  and  F  has  a  more  direct  effect  on  the  PM  force  than  the  other  coefficients 
(Ko,Ki,Bo,andBi). 
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Table  5.1  -  PM  coefficient  sets  used  for  the  simialations.  Si(Ei)  =  shoulder  (elbow)  PM 

coefficients  for  actual  PM  coefficient  set  i.  _ _ 


Coefficient 

Ideal 

SI 

El 

S2 

E2 

S3 

E3 

1.79 

1.53 

2.58 

0.984 

2.34 

1.69 

2.40 

■Ml 

0.763 

1.67 

WBM 

0.812 

0.722 

Ko 

7.17 

7.70 

IIQgB 

8.25 

6.75 

3.07 

4.28 

IQSi 

4.49 

4.35 

2.82 

2.70 

Boi 

1.01 

0.794 

0.965 

1.26 

1.11 

1.36 

1.35 

Bu(  X  10-3) 

6.91 

5.19 

4.02 

8.57 

5.21 

7.08 

6.93 

Bod(  X  10-0 

6.00 

8.60 

8.11 

5.59 

8.24 

4.22 

7.26 

Bid(  X  10-4) 

-8.03 

-5.07 

-8.53 

-9.11 

-  8.14 

-  9.41 

-7.46 

For  all  simulations,  we  use  /i*  =  5.0  and  Ti  =  1.0.  From  simulations  with 
parametric  uncertainties  within  the  ±50%  range,  we  find  that  Ai  =  12.5  and  A2  =  15.0 
satisfy  inequalities  (5.5.1).  From  (5.5.2)  we  also  have  A  =  [Ay]  where 


I  Ay  I  <  6ij  — 


/0.5, 

10^ 


i  =  j 
i^j 


(5.6.1) 


Using  these  values  and  the  simulated  functions  \0ri  —  «j|,  *  =  1»  2,  we  find  fc,-  =  50,  i  =  1, 
2,  satisfies  (5.5.8),  resulting  in  closed-loop  stability  and  convergence  of  the  trajectories  to 
the  interior  of  the  boundary  layers  and  a  guaranteed  tracking  precision  of  F*/ fii  =  0.2 
radians  for  both  joints.  Also  let  Oieq  =  —  7r/4,  d2eq  —  Pobs  —  Pote  —  310.3  kPa  (45 
psi),  resulting  in  Pots  =  490.4  kPa  (71.1  psi)  and  Po6e  =  310.5  kPa  (45  psi),  satisfying 
(5.4.3)  and  (5.4.8).  With  these  nominal  pressures  and  with  n^,  rig  as  above,  all  PM 
pressures  remain  in  the  allowable  range  206.844  ^  T*  ^  620.532  kPa  (30  <  P  <  90  psi) 
for  all  control  tasks  in  this  section. 

We  investigate  controller  performance  for  three  tracking  tasks  for  the  end  effector 
in  x-y  space:  a  sinusoidal  spline,  a  vertical  line,  and  a  circle. 

Sinusoidal  spline 

The  desired  spatial  path  is  given  by: 

Xdit)  =  0.1524  +  0.1219t  m  (5.6.2a) 

y^{t)  =  0.39624  +  0.24384sin(0.47rt  -  7r/2)  m  (5.6.2b) 


where  0  <  t  <  5  seconds. 
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The  spatial  tracking  performance  for  the  true  plant  with  PM  coefficients  in  set  1  is 
shown  in  Figure  5.4.  Tracking  performance  when  PMs  are  described  by  coefficient  sets  2 
and  3  are  similar  to  Figure  5.4.  It  will  be  noted  that  there  is  some  spatial  tracking  error, 
which  is  to  be  expected  due  to  the  parameter  errors.  An  initial  transient  can  also  be  seen, 
due  to  the  initial  pressure  adjustment  that  is  necessary  to  produce  accurate  tracking. 


Figure  5.4  -  Spatial  tracking  behavior,  PMs  in  coefficient  set  1,  mi  =  m2  =  10  kg. 


Figure  5.5  shows  the  control  efforts  Aps,  Ape  that  produced  the  tracking 
performance  in  Figure  5.4.  It  will  be  noted  that  with  the  nominal  pressures  Pots,  Fobs, 
Pote,  and  Pobe  ^ven  above,  the  PM  input  pressures  remain  within  the  allowable  range  for 
these  PMs.  The  elbow  angle  tracking  error  (which  is  larger  than  the  shoulder  angle 
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tracking  error  in  this  case)  for  coefficient  sets  1, 2,  and  3  is  shown  in  Figure  5.6.  It  is  seen 
that  for  all  systems  the  tracking  error  is  within  predicted  bounds,  i.e. 

r- 

lim  sup|0j(t)  —  0^(t)|  <  —  =0.2  radians  (5.6.3) 

to  ->  00 

This  tracking  error  is  obtained  after  the  initial  transients  have  died  away. 


Time  (seconds) 


Figure  5.5  -  Control  effort  producing  tracking  performance  of  Figure  5.4. 
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Figure  5.6  -  Elbow  angle  tracking  errors  for  3  different  plants  (PM  coeflScient  sets  1, 2, 
and  3),  sinusoidal  spline  reference  trajectory,  mi  =  m2  =  10  kg. 


Vertical  line 

The  desired  spatial  path  for  the  vertical  line  is  given  by: 

Xd{t)  =  0.6096  m  (5.6.4a) 

=  0.39624  +  0.24384sin(0.47rt  —  7r/2)  m  (5.6.4b) 

The  spatial  tracking  performance  for  PMs  with  coefficients  in  set  3  is  shown  in  Figure  5.7. 
Tracldng  performance  for  other  sets  is  similar.  The  corresponding  control  effort  is  shown 
in  Figure  5.8.  The  elbow  angle  tracking  error  (which  is  larger  than  the  shoulder  angle 
tracking  error  in  this  case)  for  PMs  with  coefficients  in  sets  1,  2,  and  3  is  shown  in  Figure 
5.9.  It  is  seen  that  for  all  systems  the  tracking  error  is  within  predicted  bounds  and  PM 
pressures  remain  within  the  allowable  range. 
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y(m) 


0.7 


Figure  5.7  -  Spatial  tracking  behavior,  PMs  in  coefficient  set  3,  mi  =  m2  =  10  kg 


•6( 


Figure  5.9  -  Elbow  angle  tracking  errors  for  3  different  plants  (PM  coefficient  sets  1,  2, 
and  3),  vertical  line  reference  trajectory,  mi  =  m2  =  10  kg. 


Circle 

The  desired  spatial  path  for  the  circle  is  given  by: 

Xd if)  =  0.36576  +  0.3048sin(0.47rt  -  0.7754))  m  (5.6.5a) 

yd{t)  =  0.36576  +  0.3048cos(0.47it  +  2.3462)  m  (5.6.5b) 

The  spatial  tracking  performance  for  PMs  with  coefficients  in  set  1  is  shown  in  Figure 
5.10.  The  corresponding  control  effort  is  shown  in  Figure  5.11.  The  elbow  angle  tracking 
error  (which  is  larger  than  the  shoulder  angle  tracldng  error  in  this  case)  for  PMs  with 
coefficients  in  sets  1,  2,  and  3  is  shown  in  Figure  5.12.  It  is  seen  that  for  all  systems  the 
tracking  error  is  within  predicted  bounds  and  PM  pressures  remain  within  the  allowable 
range. 
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Figure  5.11-  Control  effort  producing  tracking  performance  of  Figure  5.10. 
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Figure  5.12  -  Elbow  angle  tracking  errors  for  3  different  plants  (PM  coefficient  sets  1,  2, 
and  3),  circle  reference  trajectory,  mi  =  m2  =  10  kg. 


Sinusoidal  spline,  doubled  mass 

In  practical  applications,  it  may  be  expected  that  the  mass  actuated  by  the  arm  will 
change.  To  investigate  the  robustness  of  the  sliding  controller  to  changing  masses,  we 
increased  the  arm  masses  mi,  m2  each  by  a  factor  of  2  to  20  kg  and  used  the  same 
controller  as  above  to  track  the  sinusoidal  spline  reference  trajectory  (5.6.2).  Both  the 
shoulder  and  elbow  angle  tracking  errors  are  again  within  predicted  bounds,  indicating 
that  the  sliding  mode  controller  is  robust  to  changes  in  mass. 

Figure  5.13  shows  the  control  effort  produced  by  the  sliding  controller  for  this 
plant.  It  is  seen  that  ^th  the  above  nominal  pressures,  the  PM  pressures  remain  within 
the  allowable  range  for  the  duration  of  the  control  process. 
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Figure  5.13  -  Control  effort,  sinusoidal  spline  reference  trajectory,  coefficients  in  set  1, 

mi  =  m2  =  20  kg. 


5.7  Discussion 

A  two-input  sliding  mode  controller  has  been  designed  for  a  2  DOF  planar  arm 
assembly  with  Wghly  nonlinear  pneumatic  muscle  actuators  in  opposing  pair  configurations 
actuating  the  shoulder  and  elbow  joints.  Designation  of  these  joints  as  shoulder  and  elbow 
is  arbitrary  and  nominal;  however,  it  is  convenient  for  considering  applications  in  which 
pneumatic  muscle  actuated  devices  could  provide  joint  loading  to  generate  or  assist 
extremity  motion,  or  maintain  extension  of  muscles  with  contractures. 

The  control  input  for  the  planar  assembly  enters  the  process  through  nonlinear 
spring  and  fiiction  coefficients  and  a  nonlinear  contractile  force  term  that  are  contained 
within  a  mathematical  model  for  the  pneumatic  muscle  actuators.  A  dynamic  model  for 
the  arm  with  four  PM  actuators  is  derived,  and  this  is  put  in  a  form  suitable  for  sliding 
mode  control.  A  relationship  between  static  internal  pressures  is  derived  to  give  stable 
arm  behavior  in  the  absence  of  any  control  signal. 

Simulations  of  closed-loop  tracking  were  performed  with  sinusoidal  spline,  vertical 
line,  and  circle  paths  desired  for  the  assembly  end  effector.  These  paths  are  generic  in 
nature;  however,  they  can  serve  well  as  potential  building  blocks  for  an  ample  variety  of 
more  task-oriented  end  effector  paths.  Data  from  the  motion  biomechanics  lita-ature  ([29], 
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[30]),  as  well  as  dedicated  biomechanical  experimentation,  could  be  used  to  identify  actual 
end  effector  paths  for  practical  activities  such  as  feeding  and  grooming. 

Closed-loop  tracking  performance,  resulting  from  simulations,  is  ki  line  with 
theoretically  predicted  behavior.  Closed-loop  tracking  with  several  arm  models  with 
coefficients  within  a  ±50%  range  are  shown  to  agree  with  theoretical  results.  The 
controller  is  shown  to  be  robust  for  a  100%  change  in  arm  masses.  The  effects  of  heating 
are  also  presented.  It  is  seen  that  some  reasonable  amovmt  of  heating  may  be  tolerated. 
Thus,  sliding  mode  control  is  shown  to  be  a  very  promising  method  for  control  of  systems 
containing  pneumatic  muscle  actuators,  including  devices  that  could  potentially  benefit 
persons  with  neuromuscular  or  musculoskeletal  pathologies. 
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6  Neuro-fuzzy  Modeling  of  PM 


6.1  Introduction 

Recently,  neural  networks,  fu22:y  systems  and  combinations  of  these  have  obtained 
great  success  in  modeling  nonlinear  dynamics.  Of  them,  recurrent  neuro-fuzzy  systems 
are  most  effective  and  efficient  ([31]-[36]).  They  combine  the  dynamic  mapping 
capability  of  common  recurrent  neural  networks  with  the  human-like  decision-making 
capability  of  fuzzy  systems.  Some  theoretical  results  ([38],  [39])  also  prove  their 
potential  for  approximating  a  large  class  of  dynamics  to  an  arbitrary  degree  of  accuracy. 
Therefore,  we  implement  this  technique  to  develop  a  more  accurate  model  for  the 
pneumatic  muscle  by  using  soft-computing  techniques,  i.e.  fuzzy  logic  combined  with 
neural  networks,  based  on  recorded  data. 


6.2  Recurrent  Neuro-Fuzzy  Modeling  for  Pneumatic  Muscle 

According  to  studies  of  Reynolds  et  al.  [17],  the  dynamics  of  a  pneumatic  muscle 
system  can  be  described  by  a  second-order  nonlinear  differential  equation.  Therefore,  in 
order  to  model  the  PM  with  a  neuro-fiizzy  system,  we  choose  a  model  of  the  form: 

Kk)  =  f{y(k-'(),y{k-2),x{jc-\\xijc-l^+e{k)  (6.2.1) 

where  k  is  the  time  step,  y  is  the  estimated  length  of  the  muscle  and  x  is  the  voltage  input 
to  the  valve.  The  above  equation  can  be  approximated  by  a  recurrent  neuro-fuzzy 
inference  system  as  shown  in  Fig.  6.1. 

In  the  system  of  Figure  6.1,  there  are  81  Sugeno-type  rules  with  the  following 
expression  used  in  the  rule  base: 


RULEi  I  =  1,2, -,81: 

r  ^ 


IFSik-\)is- 

Mn 

Fn 

>ANDy(k-2)iS’ 

M21 

Mil 

>ANDxik-\)is- 

FZ2 

>ANDx{k-2)is< 

Fax 

F-A2  * 

THEN  y  =  Ci 

./“43. 

(6.2.2) 


where  {j  =  1,  2, 3,  4;  w  =  1, 2, 3)  are  input  membership  functions  and  C, 

(/  =  1,...81)  represents  81  local  models  in  fuzzy  partitions.  In  (6.2.2),  AND  means  the 
fuzzy  “AND”  operator,  or  T-norm.  In  this  chapter,  the  product  T-norm  is  adopted  for 
“AND.” 
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Layer  F 

L^er  E 

Layer  D 

Layer  C 


Layer  B 
L^er  A 


Fig.  6.1  -  Recurrent  neuro-fuzzy  inference  system 


6.3  Topologies 

The  recurrent  neuro-fuzzy  inference  system  is  six-layered.  Layer  A  is  an  input 
layer,  and  neurons  represent  fuzzy  variables  found  in  the  fuzzy  inference  machine.  They 
are  volts  applied  to  the  air-inlet  valve  and  length  of  muscle  estimate  in  the  past  two  time 
steps.  Layer  B  is  a  term  layer.  Nodes  in  this  layer  are  called  term  nodes  and  correspond 
to  linguistic  values  (small,  medium  and  large)  of  the  input  variables.  The  membership 
functions  ^  are  described  below.  Layer  C  is  a  rule  layer.  Nodes  with  label  FI  represent 

fuzzy  reasoning  rules  collected  in  the  rule  base.  Note  that  label  11  means  that  the  product 
“AM)”  operator  is  employed  to  calculate  the  degree  of  match  for  parts  of  the  premise, 
which  evaluates  the  degree  of  activation  of  rules.  Layer  D  is  a  defuzzification  layer.  Its 
first  nemon  calculates  the  reciprocal  of  siun  of  degrees  of  match  of  rules,  and  connection 
weights  on  links  between  other  neurons.  The  other  neurons  in  layer  D  define  local 
models  C,  appearing  in  the  consequents  of  rules.  The  neuron  in  layer  E  is  an  aggregation 
neuron,  which  aggregates  the  weighted  local  models.  Layer  F  is  the  output  layer.  The 
output  of  the  neuron  in  this  layer  represents  the  estimated  length  of  muscle  in  the  current 
instant. 

In  all,  five  different  neurons  are  defined  to  support  the  network.  These  are 
described  below,  where  v  is  the  neuron's  input  and  z  is  its  output. 
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1.  A  unity  neuron  I  with  input-output  relationship  defined  by 

z  =  v  (6.3.1) 

2.  A  sum  neuron  E  in  layer  E  implements  a  sum  operation,  which  is  used  to 
aggregate  the  weighted  local  models; 

z  =  2]c,v,  (6.3.2) 

3.  A  product  neuron  with  label  n  is  used  to  implement  a  product  “AND”  operation 

z  =  J|w,v,.  (6.3.3) 

4.  An  inverse  neuron  with  label  ^  in  layer  D  implements  the  reciprocal  operation  in 
the  dehizzification  phase: 

^  =  /(Z  —  <6-3.4) 

5.  A  term  neuron  with  label  {j  =  1, 2, 3,4;m  =  1, 2, 3)  in  layer  B  implements  a 
membership  function  defined  below: 


Mjm(y)  = 


v-a 


jm 


^jm  ^Jm 
Cjm-V 


c  ■  —b- 


bjm 

otherwise 


(6.3.5) 


6.4  Structure  Learning  via  VISIT 

At  the  beginning,  the  RNFIS  in  Figure  6.1  is  empty  without  any  rules  or  fiizzy  sets. 
The  structure  of  the  RNFIS  refers  to  the  number  of  rules,  membership  hmctions,  and 
their  properties  (i.e.  centers  and  spreads).  A  simple  self-organizing  algorithm  known  as 
the  Variable  Input  Spread  Inference  Training  algorithm  (VISIT)  [37],  developed  by  the  PI 
and  coworkers,  is  used  to  perceive  structure  features  from  a  sequence  of  training  data. 
VISIT  is  a  variation  of  the  well-known  Modified  Learning  From  Examples  (MLFE) 
algorithm  for  identification  of  fuzzy  systems  from  data.  In  VISIT,  the  membership 
functions  can  be  any  shape  in  general,  as  long  as  they  are  convex.  In  this  chapter,  we  will 
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use  asymmetrical  triangular  input  membership  functions  and  singleton  output 
membership  functions.  The  fuz2y  system  determined  by  VISIT  is  then  used  as  an  initial 
condition  for  further  tuning  via  backpropagation.  This  is  done  to  improve  BP’s  chances 
of  converging  to  the  global  minimum  of  the  error  criterion  rather  than  a  local  minimum, 
thus  producing  a  more  accurate  model  of  the  process. 

To  begin  the  VISIT  algorithm,  training  constants  aQ,  A,  e,  and  w  are  specified  by 

the  user.  The  constant  <tq  is  the  initial  value  for  the  spread  of  the  first  membership 
function  on  each  universe.  The  constant  A  determines  when  a  new  membership  function 
is  created  on  a  universe  of  discourse.  If  a  new  training  point  is  within  a  A -cut  of  an 
existing  membership  function  on  a  universe,  a  new  membership  hmction  is  not  created  on 
that  universe.  The  constant  s  is  the  maximum  identification  error  tolerated  before  a  new 
rule  and  new  memberships  are  added  to  the  fuzzy  system.  Finally,  w  determines  the 
amount  of  overlap  between  adjacent  memberships  on  a  universe.  We  now  give  the  VISIT 
algorithm. 

VISIT  Algorithm  for  Function  Approximation 

1 .  Set  /  =  1 .  Get  the  first  training  pair  =  (xj ,  X2 . On  each  input 

universe  of  discourse  form  a  fuzzy  set  a]  characterized  by  a  membership  function 

(xj  )  with  center  at  wj-  =  x*  and  spread  ctq  .  On  the  output  universe  of 

discourse  form  a  fuzzy  set  characterized  by  a  singleton  membership  function 
with  support  y’ . 

2.  Add  the  rule 

If  x\  is  aI  and  X2  is  A2  and  ...  and  x„  is  A^  theny  is  B^ . 

3.  If  there  are  no  more  training  pairs,  the  training  cycle  is  conpleted.  Otherwise, 
increment  i  and  get  the  next  training  pair  (p*  =  (x| ,  X2 , . . . ,  x„,  y)* . 


If 


<e  where  /(tp*)  denotes  the  crisp  output  of  the  existing  fuzzy 


system  evaluated  at  the  new  training  pair  (p* ,  discard  /p*  and  go  to  3. 


5,  On  the yth  input  universe,  evaluate  all  membership  functions  at  xj- .  Call  the  fuzzy 
set  whose  membership  function  is  maximum  Aj^^^  •  If  Xj  ^  A  form  a 

new  fuzzy  set  Aj^  on  the  jfth  imiverse  characterized  by  a  membership  function 
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with  center  nij^  =  Xj  and  left  and  right  spreads  crj  and  cy  as  in  step  6  below. 

On  the  output  universe  form  a  new  ftiz2y  set  characterized  by  a  singleton 
membership  function  with  support  y' . 

6.  On  each  input  universe  on  which  a  new  membership  function  was  added  in  5,  the 
spreads  of  each  side  of  the  new  membership  function  and  its  nearest  neighbors  on 
the  left  and  right  are  re-calculated  as  follows.  The  right  ^read  of  the  new 
membership  function  and  the  left  spread  of  the  nearest  right  neighbor  are  re¬ 
calculated  as 


af  = 


w 


^new  ^nRn 

mj  -mj 


(6.4.1) 


The  left  spread  of  the  new  membership  fiuiction  and  the  right  spread  of  the  nearest 
left  neighbor  are  re-calculated  as 


w 


m 


new 


(6.4.2) 


where  denotes  the  nearest  existing  center  to  the  right  of  and 

denotes  the  nearest  existing  center  to  the  left  of 


7.  Add  the  rule 


If  xi  is  Ai  and  X2  is  A2  and  ...  and  x„  is  A„  thenyis  5”^ 


If  there  is  no  other  rule  in  the  rule  base  that  is  inconsistent  (i.e.  same  premise, 
different  consequent)  with  this  rule,  where  the  fuzzy  sets  in  the  premise  are  the 
ones  maximized  by  the  corresponding  inputs 

8.  If  there  is  another  rule  (rule  p  generated  fi'om  tp^  )  in  the  rule  base  that  is 

inconsistent  with  the  rule  formed  in  7  (rule  q  generated  fi-om  the  present  training 
pair  tp^ ),  define  fuzzy  system  /^  (  /^  )  to  be  the  fuzzy  system  with  rule  p  {q) 

included  in  the  rule  base  but  rule  q  (p)  omitted  firom  the  rule  base.  Retain  rule  p 
and  omit  rule  q  if 


(6.4.3) 
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Otherwise,  include  rule  q  and  omit  rule  p. 

9.  Go  to  3. 

Comment: 

It  is  possible  that  the  rule  to  be  added  in  step  7  has  the  same  premise  but  a  different 
consequent  than  an  existing  rule,  i.e.  the  newly  created  rule  is  inconsistent  with  an 
existing  rule.  In  such  a  case,  we  need  to  determine  which  rule  provides  a  better  match  to 
the  data  so  we  know  which  rule  to  retain  in  the  rule  base  and  which  rule  to  omit.  To  do 
thiSj  we  create  two  fhzzy  systems,  each  with  only  one  of  the  inconsistent  rules  retained, 
but  with  the  other  inconsistent  rule  omitted.  In  (6.4.3),  a  comparison  of  the  two  fuzzy 
systems  is  made  on  both  training  pairs  that  generated  the  two  inconsistent  rules.  The 
fuzzy  system  having  less  total  identification  error  for  both  training  pairs  is  retained  and 
the  other  omitted.  In  this  way,  we  determine  which  of  the  two  inconsistent  rules  does  a 
better  job  of  describing  the  data  and  retain  it  in  the  rule  base,  omitting  the  other. 


6.5  Backpropagation  Training  Algorithm 

For  fliz^  systems,  training  algorithms  provide  a  means  for  refining  inference  rules 
in  the  form  of  (6.2.2).  A  general  technique  in  most  algorithms  is  that  input  and  output 
data  recorded  from  the  actual  process  are  used  in  the  calculation  of  the  current  output 
estimate  of  the  frizzy  system.  Therefore,  these  methods  statically  refine  fuzzy  rules,  and 
therefore  are  more  suitable  to  excute  static  tasks  such  as  function  mapping  and  pattern 
classification  rather  than  modeling  of  dynamics.  In  contrast  to  these  static  training 
techniques,  the  recurrent  neuro-flizzy  inference  system  implies  a  kind  of  dynamic 
refinement  for  frizzy  inference  rules,  where  past  outputs  of  the  fuzzy  sytem  itself  and 
recorded  input  data  from  the  process  are  used  to  calculate  the  current  output  esstimate  of 
the  frizzy  system  due  to  the  existence  of  some  global  feedback  connections.  As  a  result, 
rules  are  tuned  in  dynamic  ways,  and  the  resulting  systems  can  better  approximate  the 
dynamics  of  a  wide  class  of  nonlinear  systems  with  any  degree  of  accuracy  ([38],  [39]). 

Given  N  recorded  input-output  pairs  from  the  process  {x(«),  y(«)}, «  =  1, . . . ,  iV ,  we 
can  train  recurrent  neuro-frizzy  systems  by  using  the  well-known  error  backpropagation 
(BP)  training  algorithm.  To  control  the  training  process,  the  mean  square  error  (MSB)  is 
chosen  as  the  performance  index  for  evaluation  of  the  effectiveness  of  the  frizzy  system. 
If  y{n)  is  the  nth  recorded  output  (length)  of  the  pneumatic  muscle  and  y(n)  be  the 
model  output,  the  MSB  criterion  is  written  as:  ' 


MSE  = 


where  the  training  error  e(n)  is  defined  as 

e{n)=  y{n)- y(n) 


(6.5.1) 


(6.5.2) 
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The  weights  W(f)  of  the  neural  network,  which  in  this  case  consist  of  the 
membership  function  parameters,  are  adjusted  by  the  well-known  BP  training  algorithm, 
which  is  written  as  follows: 


fV(t  +  l)  =  W(t)  +  77  +  r^W(t) 

=  W(t)-  Irjein)  +  yA  If  (/) 


(6.5.3) 


where  l^  =  [aii,6,j,c,,,-”,aj3,/^3,c,3,"-,a43,b43,C43,Ci,-",C8i]  is  a  vector  of  tuning 
parameters  in  the  recurrent  neuro-fiizzy  inference  system,  the  integer  t  denotes  the  step 
during  the  training  process,  coeflBcients  7  and  y  are  learning  parameters  that  control  the 

training  process,  and  AfF(t)  =  fF(t)-fF(t-l).  Larger  7  will  accelerate  the  training 
process  but  may  introduce  oscillations.  Larger  y  will  greatly  smooth  but  slow  the 
training.  The  parameters  C, ,  ,  bj„, ,  Cj^  are  updated  according  to  the  following  rules 


C,.  (r  + 1)  =  C,  (0  -  27e(«)  ^  +  riC,  (0 

(6.5.4) 

ajm(t  +  ^)  =  a jm (0  lT]e{n)  ^  ^  +  r^Jm (0 

(6.5.5) 

bjmit  +  ^)  =  by^{t)  2T]e{ri)^^  +YAbp„{t) 

(6.5.6) 

Cjm(t  +  '^)  =  Cj„(t)  2rje(n)^^^’  ^  +rAcj„(t) 

d/ljm  dcj„ 

(6.5.7) 

The  quantities  ^  and  ^  ^  are  calculated  by  the  following  two  formulas: 

a//™  ac, 


^(»)  ^{ri)  do\  dnet\ 

do^  dnet^  aC,- 
6  6 

^  anet*  Qo\  dnet\ 
do\  dnet\ 


1  dnetk 

=  Oa - - 

^  dCi 

=  o\  •  04^* 


(6.5.8) 
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dy{n)  _  dy(ri)  dnet\ 

djUjm  do\  dnet^^  ^f^jm 

\  do\  dnet\  \  do\  dnet\ 

—  O4  - - p  - - - - p  * - 

dnet\  dfijm  dnet\ 


_  \  dnet\  j  1  dnet\ 


(6.5.9) 


I  ^Mjm  (^^^4)  i 

where  oj,  is  the  output  of  the  rth  neuron  in  the  /rth  layer  and  netl  is  the  input  of  the  7th 
neuron  in  the  Ath  layer.  This  can  be  calculated  by 


i 


for  product  neurons 


net'  =<^ 

•'  otherwise 


(6.5.10) 


The  sensitivity  of  membership  function  to  changes  in  aj„,  is  derived  as 


-j,.'’  “P’ 

0,  otherwise 


(6.5.11) 


dUjm 

Note  that  when  a,yn=v,  the  sensitivity  — does  not  make  sense.  In  all  calculations 


u/dfm 

involving  — — ,  we  define 


■>  la  .„=v 
jm 


—  \  ^jm  ^Jm 


(6.5.12) 


0,  a/m=v‘ 


Similarly,  the  sensitivity  of  membership  function  to  changes  in  and  Cj,n  is 
derived  as 
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with 


and 


with 


dfj, 


jm  _ 


ajm-^ 


(Pjm  ^jm ) 


(^yiw  ^jm ) 

0, 


^jm  —  V  <  bj/f, 


2  ’  ^jm  <  ^  —  ^jm 


Otherwise 


(6.5.13) 


jm 


dbjm 


bi^=v 

jm 


1 

c  -b  ’ 

yjm  '^jm 

-1 

^jm 

h  • 

^jm  “  ^jm 

(6.5.14) 


dfi 


jm 


bjm  ^ 


(pjm-bjm) 

0, 


2’  bj„t<V<Cjm 

otherwise 


(6.5.15) 


dfj, 


jm 


c .  =v 


0, 

1 


\^jm  ^jm 


Cjm=^ 


»  Cjm=^ 


(6.5.16) 


6.6  Dynamic  Modeling  of  PM  from  Test  Data 

In  the  following,  we  model  the  dynamics  of  a  real  pneumatic  muscle  hanging 
vertically  actuating  a  mass  of  approximately  20kg,  as  in  Figure  2.2*^  We  collected  data 
from  the  Pneumatic  Muscle  Test  Station  in  the  Human  Effectiveness  Lab  at  Wright- 
Patterson  Air  Force  Base.  All  algorithms  in  this  chapter  are  developed  in  Matlab  5.3,  and 
simulations  are  executed  on  a  PC  with  933MHz  PHI  CPU. 

A  sample  of  the  normalized  recorded  input-output  data  is  plotted  in  Figure  6.2.  In 
the  following,  we  model  the  pneumatic  muscle  using  the  recurrent  neuro-fuzzy  inference 
system  presented  in  Section  6.2  and  initiated  via  VISIT. 

We  collected  several  sets  of  input-output  data  from  the  PM  and  used  these  for 
training  and  test  data  for  modeling.  Using  these  data,  a  RNFIS  model  as  in  Figure  6.1 
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was  developed.  In  the  structure  learning  phase,  we  used  VISIT  with  5  =  0.1 ,  w  =  0.85 , 
A,  =  0.27 ,  and  o-q  =  1.17 .  These  parameters  were  chosen  to  yield  a  fiizzy  system  with  a 

relatively  small  number  of  rules  and  fiizzy  sets  on  each  universe  while  giving  relatively 
small  identification  error.  This  provided  a  good  initial  system  structure  to  be  tuned 
further  via  BP. 

In  the  parameter  learning  phase,  we  use  the  BP  algorithm  of  Section  6.5  for  tuning 
model  parameters  (i.e.  centers  and  spreads  of  the  model  membership  functions  identified 
by  VISIT).  After  approximately  20,000  iterations,  the  BP  training  was  ended  at  a  mean 
square  error  of  MSE=0.001 1,  where 

MSE^^j]{y(k)-y(k)f  (6.6.1) 

^  k=\ 

In  (6.6.1),  y{k)  is  the  output  of  the  identified  RNFIS  model,  y{K)  is  the  experimentally 
measured  PM  length  (see  Figure  6.2(a)),  and  K  is  the  number  of  input-output  pairs  used. 
This  tuning  process  yielded  the  following  32  rules  together  with  the  input  membership 
functions  plotted  in  Figure  6.3. 
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Rule-base  for  RNFIS  Model  of  PM 

1.  IfX^-1)  is  mfii  and>’(^-2)  is  mf2i  and  jc(^-2)  is  mfsi  and  Ji:(^-1)  is  mfji  then 
y(k)  =  1.900 

2.  If y{k- 1 )  is  mfn  and  y{k-2)  is  mf2i  and  x{k-2)  is  mf32  and  x{k- 1 )  is  mfn  then 
XA:)=  2.154 

3.  IfX^-1)  is  mfii  andX^-2)  is  mf2i  and  x(k-2)  is  mf32  and  Jc(A^l)  is  nif»2  then 
y(k)  =  1.805 

4.  IfX^-1)  is  nifi2  and  y{k-2)  is  mj^i  and  x{k-2)  is  nif32  and  x(A-l)  is  mff2  then 
y(k)  =  0.160 

5.  IfX^-1)  is  nifi2  and y{k-2)  is  mfzi  and  x(k-2)  is  mf32  and  x(^-l)  is,mft2  then 
X^)  =  2.481 

6.  IfX^-1)  is  mfi2andX^-2)  is  mf22  andx(^-2)  is  nif32  andx(^-l)  is  mfn  then 
yik)  =  2.478 

7.  IfXA^-1)  is  mfi2  andX^"2)  is  mf22  and  x(^-2)  is  mf3i  and  x(A^-l)  is  mfj2  then 
y{k)  =  0.668 

8.  IfX^-1)  is  mfi2  and  X^‘2)  is  mfei  and  x{k-2)  is  mf32  and  xik-Y)  is  mfn  then 
y{k)  =  0.325 

9.  IfX^-1)  is  mfi2  and X^*2)  is  mfe2  and  x(k~2)  is  mf3i  and  x(A^l)  is  mfn  then 
Xit)=  1.001 

1 0.  If y(k- 1 )  is  mfi2  and  X^’2)  is  mf22  and  x(^-2)  is  mf32  and  x(k- 1 )  is  m^3  then 
yik)  =  0.425 

1 1 .  IfX^-1)  is  mfi2  and X^"2)  is  mfzz  and  x(A:-2)  is  mf33  and  x(A^-l)  is  mfn  then 
X^)  =  2.179 

12.  IfX^-1)  is  mfi2andX^-2)  is  mf22  andx(A:-2)  is  mf3i  andx(^-l)  is  mf43  then 

y(k)  =  1.163 

13.  IfX^-1)  is  mfi2  andX^-2)  is  mf22  and  x(k-2)  is  mf33  and  x(A:-l)  is  mft2  then 
y(k)  =  0.461 

14.  IfX^-1)  is  mfi2  andX^-2)  is  mf22  and  x{k-2)  is  mf33  and  x(/:-l)  is  mf>3  then 
y(k)  =  0.876 

15.  IfXA:-l)  is  mfi3  andX^-2)  is  mL^2  and  x(k-2)  is  mf33  and x(^-l)  is  mf(3  then 
yik)  =  0.573 

16.  IfX^-1)  is  nifi3  andX^-2)  is  mf22  andx(A:-2)  is  mf33  andx(^-l)  is  mfnthen 
yik)  =  0.829 

1 7.  IfX^- 1 )  is  nifi3  and  X^-2)  is  mf22  and  x(A:-2)  is  mf3i  and  x(A:- 1)  is  mfi3  then 
Xit)  =  3.594 

18.  IfX^-1)  is  mfi2andX^"2)  is  raSti  andx(^-2)  is  mf3i  andx(^-l)  is  nifj4  then 
Xit)  =  0.611 

19.  IfX^-I)  is  mfi3andX^-2)  is  mf22  andx(A^2)  is  mf33  andx(A:-l)  is  mf42  then 
Xifc)=  1.071 

20.  IfX^-1)  is  mfii  andX^-2)  is  mf22  and  xik-2)  is  mf3i  and x(^-l)  is  mfji  then 
yik)  =  2.271 

21.  IfX^-1)  is  mfi2  andX^-2)  is  mf23  and  x(A:-2)  is  mf32  and x(fc-l)  is  mft4  then 
Xit)=  1.154 
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22.  IfX^-1)  is  mfi3  andj>;(^-2)  is  mfaa  and  x(k-2)  is  mfaa  and  x(^-l)  is  mfta  then 
X^)=  1-225 

23.  IfX^-1)  is  mfi3andX^-2)  is  mf23  andX^-2)  is mf32  andX^-1)  is m£t2  then 
yik)=\.\69 

24.  IfX^'l)  is  nifi2andX^-2)  is  mfes  andx(^-2)  is  mf3i  andx(A^-l)  is  ni£j3  then 
y(k)  =  0.534 

25.  IfX^"  1 )  is  *nfi3  and  X^"2)  is  nif22  and  x(k-2)  is  mf32  and  x(k- 1 )  is  mf42  then 
y(k)  =  0.685 

26.  IfX*-l)  is  nifn  andX^-2)  is  mf2i  and  x(k-2)  is  nrf3i  and  is  mff2  then 
Xit)=  1.841 

27.  IfX^-1)  is  nifi3  andX^-2)  is  mf23  and  x(^-2)  is  nif33  and  x(^-l)  is  m£j3  then 
y(k)  =  0.587 

28.  IfX^-1)  is  mfi3  andX^-2)  is  m^3  and  x(k-2)  is  mf33  and  x(A-l)  is  m£t4  then 
X^)  =  0.439 

29.  IfX^'l)  is  nifi3  andX^"2)  is  mf23  and  x(k-2)  is  nif34  and  x(A:-l)  is  mjEt2  then 
y(k)  =  0.607 

30.  IfXlt-1)  is  mfis  andX^-2)  is  mf23  and  x(^-2)  is  nif32  and  x(^-l)  is  mft4  then 
y(k)  =  0.094 

31.  IfX^“l)  is  nifi3  andX^-2)  is  mJ^3  and  x(k-2)  is  nrf33  and  x(A-l)  is  m£t2  then 
y(k)  =  0.893 

32.  IfX^-1)  is  nifi3  and X^-2)  is  m^3  and  x(A:-2)  is  nif33  and  x(k-l)  is  mfn  then 
yik)  =  2.043 


Figure  6.3  -  Input  membership  functions  for  32-rule  fuzzy  model  of  PM 
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The  model  output  is  compared  with  that  of  the  true  PM  in  Figure  6.4.  The 
sampling  time  in  Figure  6  is  second,  yielding  K  «  3600 .  Obviously  from  Figure 

6.4,  the  model  closely  describes  the  PM  dynamics. 

To  further  verify  the  above  model,  we  used  another  set  of  recorded  data  from  the 
same  pneumatic  muscle  to  test  the  model.  The  results  are  plotted  in  Figure  6.5.  From 
Fig.6.5a,  we  see  that  the  PM  length  smoothly  changed  in  this  data  set,  and  the  above 
obtained  model  closely  describes  this  behavior  with  small  errors  (MSB  =  0.01622). 
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MuscIeLen 


Figure  6.5  -  Model  validation:  (a)  model  and  recorded  outputs,  (b)  model  output 

error 
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7  Evolutionary  Design  of  a  Fuzzy  Classifier  from  Data 

This  chapter  presents  results  that  are  tangential  to  the  main  topic  of  the  project,  i.e.  PM 
control.  However,  since  the  results  grew  directly  out  of  our  research  on  this  project,  we 
present  them  here. 


7.1  Introduction 

Pattern  classification  refers  to  the  problem  of  partitioning  a  feature  space  into 
several  regions  and  categorizing  objects  into  classes  defined  on  these  regions.  The 

problem  of  classifier  design  is  to  find  an  optimal  mapping /from  the  feature  space  i?” 
into  the  decision  space  C,  i.e.  ->C.  There  are  many  ways  to  construct  classifiers, 

such  as  statistical  models  [40],  neural  networks  [41],  and  fuzzy  logic  systems  [42]. 

Fuzzy  logic  has  been  applied  in  several  sophisticated  classification  systems  [43] 
due  to  its  powerful  capabilities  of  handling  uncertainty  and  vagueness.  Fuzzy  logic 
brings  into  classification  systems  the  existence  of  overlapping  classes  and  a  soft  decision 
mechanism.  A  pattern  therefore  can  belong  to  several  classes  with  different  degrees  of 
membership.  Moreover,  fiizzy  classifiers  consist  of  a  set  of  fiizzy  if-then  rules,  which 
provide  insight  into  the  classifier  structure  and  improve  interpretability  [44]. 

Fuzzy  classifiers  such  as  those  above  generally  lead  to  performance  that  is  similar 
to  alternative  techniques  such  as  those  based  on  neural  networks  or  statistics.  Studies 
have  shown  that  neural  networks  and  fuzzy  systems  are  equivalent  and  convertible  ([45]- 
[47]).  The  advantage  of  fuzzy  classifiers  lies  in  their  interpretable  mle-base  stracture. 
The  fiizzy  classification  rules  are  represented  in  linguistic  forms  that  are  easily 
interpreted  and  examined  by  users.  The  interpretability  of  results  is  related  to  the  number 
of  fiizzy  rules.  Fuzzy  systems  containing  few  fiizzy  rules  are  always  more  interpretable 
than  those  with  many  fuzzy  rules.  Moreover,  a  small  rale  base  generally  improves 
performance  by  eliminating  over-fitting,  boosting  generalization  and  enhancing 
robustness. 

In  this  chapter,  a  new  evolutionary  approach  is  proposed  for  deriving  a  contact 
fiizzy  classification  system  directly  from  data  without  any  a  priori  knowledge  of  the 
distribution  of  the  data.  At  the  beginning,  the  fuzzy  classifier  is  entity  with  no  rales  in 
the  rule  base  and  no  membership  functions  assigned  to  fiizzy  variables.  Then,  rules  and 
membership  functions  are  automatically  created  and  optimized  in  an  evolutionary 
process. 

7.2  Fuzjg^  Classifier  Architecture 

The  antecedent  of  a  typical  rale  in  a  fiizzy  classifier  defines  a  decision  region  in  the 
«-dimensional  feature  space;  the  rale  consequent  is  a  class  label  from  the  finite  set 

|Cj  ...  C^j.  Atypical  fuzzy  classification  rale  is 
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Rule l:lf  is  and is and •  •  •  and x^  is  a‘^, then y  is 


where  /  =  1 . X ,  i  is  the  number  of  rules,  w  e  1, , . Af ,  M  is  the  number  of  classes,  n  is 

the  number  of  features,  and  /  (/  =  1, 2, w  )  is  a  fuzzy  set  associated  with  the  feature 

variable  x  ..  Let  A^.  be  characterized  by  the  membership  function  ,{x).  The 
J  J  A.  J 

membership  function  //  ,  can  be  triangular,  Gaussian,  or  any  other  shape.  In  this 

A . 

J 

chapter,  we  consider  asymmetric  Gaussian  membership  functions,  defined  as: 


^2 


X  .-m  . 

J _ J 


jJ 


^2 


X  .-m  . 

J _ J 


J,l 


X  .<m[ 
J  J 


x.>m[ 
J  J 


(7.2.1) 


where  ni.  denotes  the  membership  function  center  and  cr^,  and  cr^.  represent  the  right 

and  left  spreads.  IfJCj^=[xj  x^  ...  x^Jisagivenfeature  vector,  then  using  product  for 
premise  conjunction  yields  the  following  firing  strength  of  rule  : 


(7.2.2) 


The  output  of  the  fuzzy  classifier  is  determined  by  the  rule  with  the  largest  firing 
strength  for  a  given  feature  vector.  That  is,  if  for  pattern  x^  the  fth  rule  has  the  largest 

firing  strength,  will  be  categorized  into  the  class  in  the  consequent  of  that  rule.  The 

classification  error  for  pattern  x^  is  calculated  as 


10  if  x^  is  correctly  classified 
1  if  x^  is  incorrectly  classified 


(7.2.3) 
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7.3  Extracting  Fuzzy  Rules  via  the  VISIT  Algorithm 

The  basic  idea  in  many  fuzzy  learning  systems  is  to  separate  the  data  into  partitions 
by  clustering  techniques,  to  define  the  appropriate  behavior  on  each  partition,  and  then  to 
tune  the  behaviors  and/or  the  areas  of  partitions  via  such  optimization  techniques  as  the 
Recursive  Least  Squares  (RLS),  the  Levenberg-Marquardt  (LM),  and  evolutionary 
algorithms.  The  most  straightforward  and  original  methods  used  in  constructing  fuzzy 
systems  fi'om  data  are  the  Learning  From  Examples  algorithm  (LFE)  [48]  and  the 
Modified  Learning  From  Examples  algorithm  (MLFE)  [49].  The  Variable  Input  Spread 
Inference  Training  algorithm  (VISIT)  [37]  is  similar  to  LFE  and  MLFE  and  is  most 
similar  to  MLFE.  The  basic  idea  of  VISIT  is  that  all  adjacent  membership  fimctions  on  a 
given  universe  of  discourse  cross  at  the  same  level.  VISIT  shows  some  interesting 
features  compared  to  its  peers. 

The  LFE  algorithm  relies  on  a  given  set  of  input  membership  fimctions  that  may  not 
adequately  cover  the  universe  of  discourse  or  accurately  describe  the  nature  of  the  data 
clustering.  If  the  membership  functions  do  not  adequately  cover  the  universe  of 
discourse,  the  classifier  will  not  converge  to  an  acceptable  system  and  adequate 
performance  will  not  be  achievable.  The  rule  base  can  grow  unmanageably  and 
unnecessarily  large  when  the  width  of  the  chosen  membership  functions  is  significantly 
smaller  than  the  relevant  cluster  size  of  the  data. 

The  MLFE  algorithm  chooses  its  membership  function  widths  as  a  function  of 
learning  constants  and  distance  to  the  closest  existing  membership  function  on  the  current 
universe  of  discourse.  The  training  can  be  very  sensitive  with  respect  to  the  learning 
constants.  But  the  symmetrical  nature  of  the  membership  functions  precludes  prudent 
choice  of  membership  function  widths  on  at  least  one  side  of  each  membership  function. 
Each  new  membership  function  tends  to  decrease  in  width  relative  to  earlier  created 
membership  functions.  As  a  consequence,  membership  function  widths  can  differ  fi-om 
each  other  significantly.  With  decreasing  membership  function  widths,  a  rule  e3q)losion 
similar  to  that  of  LFE  with  narrowly  specified  membership  functions  could  be  realized. 

The  VISIT  algorithm  addresses  the  above  problems  by  prudently  choosing  and 
actively  adjusting  the  widths  of  asymmetrical  memibership  fimctions.  For  asymmetrical 
triangular  and  several  other  membership  function  shapes,  by  choosing  appropriate  design 
parameters,  these  membership  functions  can  form  fuzzy  partitions  of  unity.  Because  the 
membership  function  widths  are  created  as  a  function  of  the  training  data,  the  input 
membership  explosion  experienced  by  LFE  when  cluster  size  exceeds  membership  width 
is  significantly  reduced  or  eliminated  completely.  VISIT  adjusts  membership  Action 
widths  on  each  side  of  the  membership  fimction,  based  on  the  distance  to  the  closest 
membership  function  on  either  side  of  Ae  new  membership  function.  At  the  same  time, 
widths  of  existing  nearest  neighbor  memberships  are  recalculated  to  account  for  the 
newly  added  membership  function.  A  result  of  active  adjustment  of  existing  widths  is 
that  the  membership  function  vddths  shrink  at  a  much  lower  rate,  one  that  is  appropriate 
for  a  given  data  set.  The  membership  widths  differ  fi-om  each  other  only  as  indicated  by 
the  nature  of  the  data  set,  rather  than  as  a  function  of  the  training  algorithm. 

Before  starting  VISIT,  we  first  choose  a  vector  of  initial  spreads 
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E  =  [o-j  ...  cr^J  for  the  first  membership  functions  created,  a  vector 

A  =  [oj  02  •••  ^„\  alpha-cut  values,  and  a  vector  W  =  [wj  ...  w^J  of 

parameters  determining  the  degree  of  overlap  between  adjacent  membership  functions  on 
each  universe.  We  now  give  the  VISIT  algorithm  for  pattern  classification. 

VISIT  Algorithm  1371 

1.  Set  /  =  !.  Get  the  first  training  pair  tp^  where  isthe  class  of 

data  pair  1  represented  by  a  real  number.  On  each  input  universe  of  discourse  form 
a  fiizzy  set  A^.  characterized  by  a  symmetrical  Gaussian  membership  function 

with  center  at  ffi^.  Q  =  jc*  and  spread  a..  On  the  output  universe  of 

discourse  form  a  fozzy  set  characterized  by  a  singleton  membership  function 
with  support  . 

2.  Add  the  rule 

If  Xj  is  A^  and  x^  is  A^  and  •••  and  x^  is  ^4*  thenyis  5*. 

3.  If  there  are  no  more  training  pairs,  the  training  cycle  is  completed.  Otherwise, 
increment  i  and  get  the  next  training  pair  tp'  =  (Xj,  x.^,...,x^,  y)' . 

4.  Evaluate  tp'  with  the  current  classifier.  If  classification  is  correct,  go  to  step  3, 
Otherwise,  go  to  step  5. 

5.  On  the yth  input  universe  y  =  1, ..., « ,  evaluate  all  membership  functions  at  x'. .  Call 
the  fuzzy  set  whose  membership  function  is  maximum  Aj^  and  its  a^.-cut  set 

If  Xj€aj-cut^^^^,  form  a  new  fuzzy  set  aJ^  on  the yth 

ftsw  i 

universe  characterized  by  a  membership  function  with  center  m  =x  and  left 

j  J 

and  right  spreads  cr  and  as  in  step  6  below.  On  the  output  universe,  if  the 

pattern  tp‘  belongs  to  a  new  class,  add  a  new  fuzzy  set  B’  characterized  by  a 

singleton  membership  function  with  support  y‘ .  Otherwise,  the  class  B'  already 
exists  fi’om  previous  data. 

6.  On  each  input  universe  on  which  a  new  membership  function  was  added  in  step  5, 
the  iq)read  of  each  side  of  the  new  membership  function  is  initialized  as  o-j . 

However,  if  there  exist  neighboring  memberships  on  the  left  and/or  right  of  the  new 
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one,  the  spread  of  the  corresponding  side  of  the  new  membership  function  and  its 
nearest  neighbors  on  the  left  and/or  right  are  re-calculated  as  follows.  The  right 
spread  of  the  new  membership  function  and  the  left  spread  of  the  nearest  right 
neighbor  are  re-calculated  as 


J  w. 

J 


,^new  ^nRn\ 
\fn .  —m. 

J  J 


(7.3.1) 


The  left  spread  of  the  new  membership  function  and  the  right  spread  of  the  nearest 
left  neighbor  are  re-calculated  as 


L  1 

tJ .  =  — 

J  w. 
J 


,  new  riLn 
\m,  -m, 

J  J 


(7.3.2) 


where  denotes  the  nearest  existing  center  to  the  right  of  and  jw^” 

denotes  the  nearest  existing  center  to  the  left  of  . 

7.  If  a  new  membership  function  was  formed  in  step  6,  consider  the  following 
candidate  rule  where  the  fuzzy  sets  in  the  premise  are  the  ones  inaximized  by  the 
corresponding  inputs: 

If  jCj  is  and  is  and  •••  and  is  thenj;  is  5' 

If  there  is  no  rule  in  the  rule  base  that  is  inconsistent  (i.e.  same  premise,  different 
consequent)  with  this  rule,  add  the  above  rule  in  the  rule  base.  If  there  exists  an 

inconsistent  rule,  discard  (p*  and  go  to  step  3. 

8.  If  there  is  another  training  pattern,  go  to  step  3.  Otherwise,  the  training  process 
ends. 


Comments 

Note  that  the  above  VISIT  algorithm  is  somewhat  different  fi-om  the  original  one  in 
[37].  In  [37],  the  authors  assumed  that  all  universes  of  discourse  were  identical,  hence 
scalar  values  of  w,  a,  and  a  were  chosen  to  begin  the  algorithm.  However,  in  many  real 
systems,  different  featmes  often  work  in  different  universes  of  discourse,  so  we  apply 
tteee  vector  parameters  W,  A,  and  S  for  the  VISIT  algorithm.  Choosing  different 
values  of  W,  A,  and  L  results  in  different  fiizzy  systems  from  the  same  data  set.  The 
main  problem  in  finding  a  good  VISIT  classifier  amounts  to  finding  good  values  of  the 
parameters  W,  A,  and  2.  In  the  original  VISIT,  these  parameters  were  chosen  by  trial 
and  error. 

The  vector  parameter  W  determines  the  degree  of  overlap  between  two  neighboring 
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membership  functions.  Large  w.  results  in  small  overlaps  in  the  yth  input  universe  of 

discourse.  It  can  be  shown  that  two  neighboring  membership  functions  always  overlap  at 
/  8 

e  when  asymmetric  Gaussian  membership  functions  are  used.  Therefore,  the 

resulting  fuzzy  system  will  meet  the  so-called  e-completeness  condition,  i.e.,  there  always 
exists  at  least  one  rule  with  firing  strength  of  at  least  e  for  any  pattern  in  the  operative 
region.  It  can  be  proved  that  the  firing  strength  of  any  fuzzy  rule  generated  by  VISIT  will 

be  no  less  than  ]J[®  • 

/=! 

Parameters  A  determine  when  a  new  rule  is  added  to  the  classifier,  and  £ 
determines  the  initial  spreads  of  the  first  membership  functions.  Larger  a.  and  smaller 

cTj  tend  to  increase  the  number  of  membership  functions  for  the  yth  input,  while  smaller 

a.  and  larger  <t.  tend  to  decrease  the  number  of  membership  functions.  In  some 

J  J 

situations,  if  is  large  enough  and  a.  is  small  enough,  a  membership  function  will  be 


generated  only  for  the  /th  input,  which  implies  feature  reduction  because  the  same 
membership  function  occurs  in  all  rules. 

— /  8 

When  a.<e  ,  a  new  membership  function  is  formed  only  if  the  new  feature 
amoimt  fells  outside  the  Oj  -cut.  We  can  estimate  the  distance  between  two  neighboring 


membership  functions  as: 


distance  >  a .  J- 2  In  a . 

J^j  J 


(7.3.3) 


If  iM  denotes  the  center  of  the  first  membership  function  for  the  Jth.  input,  we  can 


derive  other  membership  function  centers  recursively  from  the  above  equation. 


iw =  min 
y,*  i 


X*.  >»?.,  ,  +cr .  l-llna  . 
J  J,k-l  jyJ  J 


for  membership  function  centers  to  the  right  of  m  and 


(7.3.4) 

(7.3.5) 


(7.3.6) 
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f 


m 


J-k 


=  max 
i 


K 


x\<m . 

J  J-k+l 


-cr . 
J 


.  l-2lna. 

n  j) 


(7.3.7) 


for  membership  function  centers  to  the  left  of  m  .  Obviously,  the  final  fu2zy  system 

depends  on  m  ,  which  is  traditionally  equal  to  the  value  of  the  first  feature  amount  fed 

into  the  learning  machine.  In  other  words,  the  original  VISIT  algorithm  depends  on  the 
sequence  of  patterns  fed  in.  In  the  sequel,  we  ease  this  limitation. 

When  e  <a.<\,  VISIT  will  add  a  new  membership  fimction  between  two 
neighboring  membership  functions  when  a  new  pattern  has  an  input  x.  such  that 


X .  e 
J 


! . a .  /-21na .,  m  . ,  ,  +  c .  l-llna . 
j,l  ;V  J  J^j  J 


where  m .  .and  m..  are  the  centers  of 


two  neighboring  membership  functions  and  ntj  i-  Obviously,  if  Oj  is  large,  this 

can  relax  the  dependence  on  the  sequence  in  which  the  patterns  are  fed  to  the  learning 
mechanism. 


7.4  Extraction  of  Fuziy  Rules  via  Evolutionary  Algorithms 

Consider  the  training  data  set  X  =  \^Xj^  =  \^yX^,—x^,yf  \.  The  single  data  point 

is  an  element  of  the  space  for  each  k.  The  VISIT  algorithm  can  be  considered  as 

defining  a  function  mapping  jp(  )  jfrom  the  training  data  set  X  to  the  space  of  fuzzy 

systems  FIS,  i.e. 

FIS=  F  {X)  (7.4.1) 

W,A,E,M 


Note  that  an  extra  parameter  vector  M  = 


m. 


2,0 


consisting  of  the 


centers  of  the  first  membership  functions  has  been  added.  Parameters  M  can  relax  the 
dependence  of  VISIT  on  the  sequence  in  which  the  pattern  is  fed.  The  process  for  the 
creatiqn  of  a  “good”  fuzzy  classifier  via  VISIT  is  equivalent  to  looking  for  optimal 
parameters  W,  A,  S,  and  M,  where  “goodness”  is  evaluated  by  some  performance  index 
J,  such  as  and  below.  Therefore,  the  problem  of  generating  a  fuzzy  system  from 

data  via  VISIT  can  be  mathematically  expressed  as  a  multi-objective  optimization 
problem: 


min  u) 
W,A,2:.IVf 


(7.4.2) 
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subject  to 


<w  .< 

J 

0<a , <  1 
j 

min  .  ^  .  ^max 

CT  .  <  cr .  <  <7 . 

J  J  J 


X .  <m .  ^  <  X . 

J  y.o  J 


(7.4.3) 

(7.4.4) 

(7.4.5) 

(7.4.6) 


In  (7.4.3-7.4.6), 


max 


defines  the  range  of  acceptable  overlaps,  and 


♦  O'; 


^max 


defines  the  range  of  acceptable  initial  membership  function  spreads  for  the 


fSa.  feature,  and  [jc”'” ,  jc*!™”  ]  is  the /th  imiverse  of  discourse. 

J  is  the  performance  index  for  the  evaluation  of  the  fiizzy  system.  We  often 
evaluate  a  fuz2y  system  in  two  ways: 


1.  Accuracy:  the  fiizzy  system  should  accurately  describe  the  behaviors  embedded  in 
the  data. 

2.  Interpretability:  the  size  of  the  fuzzy  rule-base  should  be  small  enough  to  be  easily 
understood.  Interpretability  is  one  of  the  features  that  distinguish  a  fiizzy  classifier 
from  its  peers.  Moreover,  for  a  fiizzy  system,  good  interpretability  often  implies 
robust  performance  on  unseen  data.  The  interpretability  of  a  fiizzy  system  is 
inversely  related  to  the  number  of  membership  functions,  the  number  of  rules,  and 
premise  length. 

Traditional  optimization  methods  are  based  on  the  assumption  that  the  performance 
index,  or  objective  function,  is  differentiable.  Unfortunately  in  the  above  optimization 
problem,  we  cannot  in  general  assume  the  objective  function  J  is  differentiable  with 
respect  to  parameters  W,A,2:,andM.  One  way  to  overcome  this  problem  is  to  use  a 
genetic  algorithm  (GA)  to  optimize  J .  GAs  provide  powerful  tools  to  efficiently  search 
in  poorly  understood,  irregular  spaces,  and  are  inspired  by  the  mechanism  of  natural 
evolution.  For  some  difficult  optimization  problems,  GAs  can  perform  better  than 
traditional  optimization  techniques,  such  as  hill-climbing  methods,  by  means  of 
exploitation  of  the  best  solutions  and  exploration  of  the  search  space  simultaneously.  The 
genetic  algorithm  used  in  this  chapter  is  described  as  follows. 

Step  1 .  Let  r = 0 .  Initialize  the  first  population  with  A  individuals  (W  even). 

Step  2.  Evaluate  the  fitness  yj.' ,  i  =  1, . . . ,  7/  of  the  W  individuals  in  P^  (fitness  defined 
below). 
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Step  3.  Select  the  NH  fittest  individuals  in  population  for  mutation  (defined  below). 
Mutate  the  NI2  fittest  individuals  to  generate  NH  new  individuals  P' . 

Step  4.  Randomly  select  NIA  pairs  of  individuals  and  apply  a  crossover  operator  on  each 
pair  to  generate  NI2  more  new  individuals  . 

Step  5.  Select  the  N  fittest  individuals  from  the  current  population  and  to 

construct  the  next  population  . 

Step  6.  Let  /  =  r  + 1  and  go  to  step  3.  Repeat  until  t  >  ,  where  is  a  predetermined 

end  time. 

In  Step  5,  the  elitist  selection  method  always  preserves  the  N  best  individuals  in  the 

fth  evolution. 


Solution  Representation 

Chromosomes  are  used  to  describe  individuals  in  the  population.  The  VISIT 
algorithm  provides  an  encoding  method  to  create  chromosomes  for  different  individuals. 


The  vector  P^(/)=  wj (/),•• -.w/C/Xal (/),•••, a' 


eP 


4wxl 


(chromosome)  represents  an  individual  in  our  genetic  algorithm.  In  P(/) ,  t  represents  the 

generation  number  and  /  represents  an  individual  in  the  population.  Each  element  in 
P^{i)  always  corresponds  to  the  same  variable  regardless  of  t.  This  is  necessary  for 

efficient  utilization  of  the  crossover  operators.  Moreover,  the  chromosome  allows  us  to 
initialize  individuals  according  to  constraints,  i.e. 


w.(i)  =  w' 
j 


min 


+  r.^  .(W 

i,J 


a®(0  =  r^. 

J  '  hj 


max 


,  mm  ^ 


O/.x  „min 
O'. (0  =  5. 

J  J 


i,r  J  J 


hj  J  J 


(7.4.7) 

(7.4.8) 

(7.4.9) 
(7.4.10) 


where  rjj,  rfj,  and  r^j  are  random  numbers  uniformly  distributed  in  [0,  1].  This 
initialization  limits  the  search  space,  and  makes  the  optimization  problem  more  efficient. 

Genetic  Operators 

There  are  two  kinds  of  genetic  operators  used  in  the  proposed  genetic  algorithm  - 
whole  arithmetic  crossover  and  Gaussian  mutation.  The  Ath  pair  of  selected  individuals 
/J(i)  and  P^(J)  are  crossed  via  whole  arithmetic  crossover  operators.  The  two  resulting 
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offspring  from  this  pair  are 


P^'(l)=/-P,(/)+(l-r)/;(y) 

Pt2)=rP/y)+(l-r)P(/) 


(7.4.11) 

(7.4.12) 


where  r  is  a  random  number  uniformly  distributed  in  [0,  1].  As  stated  in  step  4  above, 
this  process  is  followed  for  7V74  randomly  chosen  pairs  of  individuals. 

We  define  the  following  mutation  operator  to  create  a  new  individual  from  the  ith 
individual  in  the  rth  generation  P^{i)  (step  3  above): 


f 

=  PfiiJ)  +  rjPj  (Oexp 

V 


■^max  J 


(7.4.13) 


where  j  signifies  a  particular  element  of  the  chromosome,  Yj  is  a  learning  coefficient 

which  determines  the  amount  of  maximum  change  of  the  yth  element  in  the  chromosome 
from  generation  to  generation,  yo^.(/)  is  a  uniformly  distributed  random  number  g  [-1,  1], 

f*  is  the  fitness  of  individual  i  in  the  fth  generation,  and  is  the  largest  fitness  in  the 
rth  generation. 

Note  that  some  infeasible  offspring  can  be  created  via  the  mutation  operator 
(7.4.13),  so  we  have  to  examine  each  individual  offspring  to  insure  it  meets  the 
constraints  of  (7.4.3-7.4.6).  If  an  offspring  is  infeasible  we  discard  it,  decrease  yj  by 

half,  and  apply  the  mutation  operator  again  until  a  feasible  offspring  is  obtained.  Another 
simple  method  to  generate  a  feasible  offspring  and  replace  the  infeasible  one  is  to  use 
initialization  equations  (7.4.7-7.4.10). 

Evaluation  of  Individual  Fitness  via  a  Fuzzy  Expert  System 

In  the  above  evolutionary  algorithm,  the  fitness  function  used  to  evaluate  the 
goodness  of  each  individual  is  critical  to  the  selection  operation  of  the  evolutionary 
process.  For  fuzzy  systems,  we  care  about  resolution  and  interpretability,  which  are 
conflicting  requirements.  The  resolution  of  the  fuz2y  classifier  is  related  to  the  number 
of  misclassifications.  Many  misclassifications  indicate  low  resolution.  The 
interpretability  of  the  fuzzy  system  is  related  to  the  number  of  rules  in  the  rule-base. 
Many  rules  impair  interpretability.  In  general,  we  can  separately  use  the  Mean 
Classification  Error  (MCE)  for  measuring  the  resolution  of  the  fuzzy  classifier,  and  the 
number  of  rules  as  an  inverse  indicator  of  interpretability.  The  MCE  criterion  is  as 
follows: 
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(7.4.14) 


where  is  defined  in  (7.2.3)  and  ^  is  the  number  of  data  points  in  the  training  set. 

Interpretability  of  the  fuzzy  system  can  be  inversely  measured  by: 

J^=PiL-M)  (7.4.15) 

where  L  is  the  number  of  rules  and  M  is  the  total  number  of  classes  in  the  training  set, 
which  suggests  the  minimum  number  of  rules  in  the  fuzzy  system.  The  parameter  yff  is  a 
factor  to  normalize  the  possible  size  of  rule  base  into  the  range  [0,  1].  In  this  chapter^  we 
use  fi=0.0\ . 

The  objective  functions  Jy  and  J2  always  lie  between  0  and  1.  A  good  fuzzy 
classifier  should  have  small  MCE  and  few  rules.  However,  a  decrease  in  MCE  does  not 
imply  a  decrease  in  the  number  of  rules,  and  vice  versa.  Usually,  resolution  is  improved 
by  increasing  the  number  of  rules.  Thus  it  is  necessary  to  seek  a  trade-off  between 
resolution  and  interpretability  in  the  design  of  fuzzy  classifiers.  In  this  chapter,  the  task 
of  designing  a  fiizzy  system  is  formulated  as  the  following  two-objective  optimization 
problem: 


min 

W,A,2:,M 


min 

W,A,E,M 


(7.4.16) 


An  important  technique  for  solving  a  multi-objective  optimization  problem  is  to 
combine  multiple  objectives  into  one,  reducing  the  complex  problem  into  a  standard 
single-objective  one.  A  well-known  tradeoff  of  multiple  objectives  is  to  define  a  single 
objective  function  that  combines  both  Jj  and  J2  as 


(7.4.17) 


where  0  <  7  <  1 .  With  this  objective  function,  a  fitness  function  fit  could  be  defined  as 


Possible  difficulties  with  using  this  method  are: 


1.  The  result  can  vary  significantly  as  tj  changes.  It  is  difficult  to  determine  an 
appropriate  value  of  rj  when  we  do  not  know  enough  information  about  the 
problem.  In  general,  we  have  to  design  rj  via  trial  and  error,  which  is  often  time- 
consuming. 

2.  Due  to  the  complexity  of  the  classification  problem,  the  factor  77  does  not  reliably 
reflect  the  relative  importance  of  objectives.  It  is  only  a  factor  which,  when  varied, 
changes  the  result.  The  result  is  also  dependent  on  the  units  in  which  the  objective 
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functions  are  expressed.  In  practice,  objectives  often  have  different  importance.  If 
we  want  7  to  reflect  the  relative  importance  of  the  objectives,  all  objectives  should 
be  normalized  in  units  of  approximately  the  same  numeric  values. 

For  the  above  reasons,  we  do  not  use  (7.4.17),  but  instead  apply  a  fiizzy  expert 
system  for  comprehensively  evaluating  the  above  two  objectives  for  individuals  in 
generations  according  to  expert  knowledge.  The  fuzzy  expert  system  consists  of  two 
inputs  (Jj  and  J^),  one  output  (fit),  and  eleven  T-S  fuzzy  rules.  The  output 

recommended  by  rule  k  in  the  fuzzy  ejqpert  system  is  .  Table  1  lists  the  rules  in  the 

fuzzy  expert  system,  and  Figure  1  shows  the  inputs’  membership  functions.  The  rule 
base  and  membership  functions  are  the  result  of  our  general  knowledge  on  a  “good” 
fuzzy  system. 


Table  7.1  -  Fitness  rules 


Rule  # 

Membership  Fimction 

(k) 

Ji 

J2 

1 

Zero 

Zero 

80 

2 

Zero 

Small 

70 

3 

Zero 

Medium 

50 

4 

Small 

Zero 

60 

5 

Small 

Small 

40 

6 

Small 

Medium 

20 

7 

Medium 

Zero 

30 

8 

Medium 

Small 

10 

9 

Medium 

Medium 

1 

10 

— 

Large 

-20 

11 

Large 

— 

-20 
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Figure  7.1  -  Input  membership  functions  for  fuzzy  expert  system 

The  outpxit  of  the  fuzzy  expert  system,  global  fitness,  is  calculated  as  a  weighted 
average  of  the  local  fitness  functions: 

11 

=  -  (7.4.18) 

jfc=i 


where  is  a  local  fitness  taken  from  the  last  column  of  Table  1  and  is  the  firing 
strength  of  the  Ath  mle  (7.2.2).  The  fitness  surface  is  plotted  in  Figure  2. 
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7.5  Applications  and  Performance  Evaluation 

This  section  will  evaluate  the  performance  of  the  proposed  algorithm  on  four 
benchmark  data  sets  in  classification,  the  iris  data,  the  wine  data,  the  Wisconsin  breast 
cancer  data,  and  the  Pima  Indians  diabetes  data.  These  data  sets  can  be  obtained  from  the 
machine-learning  database  at  University  of  California,  Irvine  via  an  anonymous  ftp  server 
(ftp://ftp.ics.uci.edu/pub/machine-leaming-databases/).  It  is  to  be  noted  that  we  corrected 
the  errors  in  the  UCI  iris  data  [50],  All  data  sets  are  normalized  to  the  range  [0, 1]. 

Examnlel  -  Wine  data 

The  wine  data  contains  the  chemical  analysis  of  178  vdnes  grown  in  the  same 
region  in  Italy  but  derived  from  three  different  cultivars.  Thirteen  continuous  attributes 
are  measured  on  each  wine:  alcohol  content  (AL),  Malic  Acid  content  (MAC),  ash 
content,  alcalinity  of  ash  (AA),  magnesium  content  (MA),  total  phenols  (TP),  flavanoids 
(FL),  nonflavanoids  phenols  (NFP),  proanthocyaninsm  (PR),  color  intensity  (Cl),  hue, 
OD280/OD315  (O)  of  diluted  wines,  and  praline  (P).  The  numbers  of  patterns  m  three 
classes  are  59,  71  and  48  respectively. 

The  wine  data  has  been  widely  used  to  test  the  performance  of  classifier  systems. 
Table  7.5  shows  the  results  of  some  well-known  classifier  systems.  Setnes  et  al.  [51] 
applied  a  real-coded  GA  and  a  c-means  clustering  algorithm  on  all  178  patterns  to  design 
a  TSK-type  fuzzy  classifier  system.  In  their  method,  the  c-means  clustering  technique 
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was  used  to  generate  an  initial  system  with  three  rules.  After  repetitive  simplification  and 
optimization,  only  9  features  were  selected  and  generated  a  fuzzy  classifier  system  with 
only  three  rules  and  a  high  recognition  rate  of  98.3%  (3  misclassifications). 

Ishibuchi  et  al.  [52]  applied  all  178  samples  to  the  design  of  a  fiizzy  classifier  via 
three-objective  genetic-based  machine  learning  techniques.  At  the  beginning,  there  are 
1834  candidate  rules  in  the  rule  base.  The  obtained  best  result  is  a  fuzzy  system 
containing  6  rules  and  9  term  sets  with  a  classification  rate  of  100%.  In  their  recent  work 
[82],  they  update  their  algorithm  by  integration  of  an  evolutionary  multi-objective 
algorithm  (NSGA-II  [83]),  and  a  smaller  fuzzy  system  was  designed  with  100  %  correct 
classification.  This  fuzzy  system  only  contained  4  fuzzy  rules  and  5  term  sets. 

Wang  et  al.  [53]  dso  used  all  178  patterns  to  design  their  fuzzy  classifier.  They 
applied  the  Mapping-constraint  Agglomerative  Clustering  (MAC)  method  to  detect  the 
structure  of  the  fozzy  system,  and  improved  the  fuzzy  classifier  via  a  parameter  learning 
method.  Finally,  several  3-rule  classifier  systems  on  13  features  were  derived  with  the 
best  recognition  rate  of  99.4%  (1  misclassification). 

Roubos  et  al.  [54]  proposed  an  iterative  approach  to  design  a  fuzzy  classifier  system 
using  all  178  patterns.  To  begin  their  algorith^  they  built  a  3-rule  ^izzy  system  using  a 
total  of  13  features,  then  iteratively  applied  feature  selection,  rule  base  simplification  and 
GA  based  parameter  optimization  to  improve  the  performance  and  reduce  the 
dimensionality  of  the  classifiers.  Finally,  they  developed  three  5-feature  fuzzy  classifiers. 
The  classification  results  of  the  three  classifiers  are  98.9%,  99.4%,  and  98.3%  correct. 


Rule  Al  MAC 

1  1  1 

2  2  1 

3  2  1 

4  2  1 

5  2  1 


Table  7.2  -  VISIT  classification  rules  for  wine  data _ 

Ash  AA  MA  TP  FP  NFP  PR  Cl  Hue  O  P  Class 

1  1  111111111  1 

2  1  112111122  2 

1  1  111111212  2 

2  1  1  1  1  1  1  1  3  2  2  2 

1  1  1121113223 
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PralinG 


Figure  7.3  -  VISIT  membership  functions  for  13  features 

Similarly,  we  also  used  all  178  data  points  to  design  our  fuzzy  classifier  system. 
But  differently  from  the  above  methods,  we  do  not  apply  any  clustering  techniques  to 
detect  initial  model  structures.  At  the  beginning,  the  frizzy  system  is  empty  without  any 
rules,  and  the  proposed  method  is  able  to  automatically  extract  and  optimize  a  fuzzy  rule 
base  based  on  available  patterns. 

The  VISIT  algorithm  is  applied  to  create  a  frizzy  system  from  the  wine  data,  and  its 

parameters  W  =  [wj,W2,...,Wj2j,  A  =  =  and 

M=  ,  are  optimized  by  the  evolutionary  algorithm  described  in 

Section  7.4.  The  value  of  parameters  W,  A,  E,  and  M  were  randomly  initialized 
according  to  (7.4.7-7.4.10).  The  population  size  in  each  generation  is  200,  and  the 
evolutionary  process  is  stopped  after  100  iterations.  The  classification  rate  for  the 
resulting  frizzy  system  is  98.9%  correct  (2  misclassifications).  Figure  7.3  shows  the 
membership  functions  of  the  13  features,  and  Table  7.2  lists  all  fuzzy  rules.  Table  7.3 
shows  the  classification  performance  using  VISIT.  The  frizzy  system  can  accurately 
recognize  Class  3  with  no  errors. 
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Table  7.3  -  VISIT  classification  on  3  classes 


Classl 

Class2 

Class3 

Total 

Total  Patterns 

59 

71 

48 

178 

Misclassifications 

1 

1 

0 

2 

Note  that  the  proposed  algorithm  creates  only  one  membership  fonction  for  7  of  the 
features:  malic  acid,  alcalinity  of  ash,  magnesium,  total  phenols,  nonflavanoids  phenols, 
proanthocyaninsm,  and  color  intensity.  Obviously,  these  7  features  are  redundant  to  the 
obtained  classifier  system  because  they  contribute  the  same  to  all  rules.  After  deleting 
these  redundant  features,  the  membership  functions  and  rule  set  of  the  final  system  is 
shown  in  Figure  7.4  and  Table  7.4. 

The  proposed  algorithm  developed  a  compact  fuzzy  system  without  any  assumption 
on  the  structure.  Moreover,  the  classification  rules  are  highly  interpretable,  utilizing  only 
six  features,  five  rules  and  no  more  than  three  membership  functions  associated  with  each 
feature.  Table  7.5  shows  a  comparison  of  our  obtained  classifier  with  the  above  well- 
known  classifier  systems.  Table  7.5  compares  the  various  methods  on  the  number  of 
used  features,  the  number  of  term  sets,  the  number  of  rules,  and  the  recognition  rate. 


Hue  OD280/OD315  Praline 

Figure  7.4  -  VISIT  membership  functions  for  6  features 
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Table  7.4  -  VISIT  classification  niles  for  wine  data  (6  features) 


Rule 

A1 

Ash 

FP 

Hue 

0 

P 

Class 

1 

1 

1 

1 

1 

1 

1 

1 

2 

2 

2 

2 

1 

2 

2 

2 

3 

2 

1 

1 

2 

1 

2 

2 

4 

2 

2 

1 

3 

2 

2 

2 

5 

2 

1 

2 

3 

2 

2 

3 

Table  7.5  -  Comparison  of  results  for  wine  data 

Features  Term  sets 

Rules 

Recognition  rate  on 
total  data  set  (%) 

Setnes  et  al.  [51] 

9  21 

3 

98.3 

Wang  et  al.  [53] 

13  34 

3 

99.4 

Ishibuchi  et  al.  [52] 

9 

6 

100 

Ishibuchi  et  al.  [82] 

5 

4 

100 

Roubos  et  al.  [54] 

5  15,11,10 

3 

98.9, 98.3,  99.4 

This  chapter 

6  13 

5 

98.9 

For  more  reliable  performance  evaluation  of  the  proposed  algorithm,  we  ran  it 
independently  on  the  wine  data  ten  times,  with  the  random  initializations  of  parameters 
W,  A,  E,  and  M  on  each  run.  Table  7.6  reports  the  results  of  ten  trials.  The  resultant 
system  utilized  an  average  of  6.7  feature  variables,  16.7  membership  functions  and  6 
fuzzy  rules  to  get  to  an  average  recognition  rate  of  98.2%  (3.3  misclassiflcations).  The 
best  recognition  rate  is  98.9%  (2  misclassifications).  The  “best”  system  utilized  6  feature 
variables,  14  membership  functions  and  5  fuzzy  rules.  The  worst  recognition  rate  is 
97,8%  (4  misclassifications).  The  related  system  utilized  7  feature  variables,  21 
membership  functions  and  7  rules. 


Table  7.6  -  Results  of  ten  runs  on  wine  data 


1 

2 

3 

4 

5 

6 

7 

8 

9 

Avg 

Misclassifications 

2 

4 

4 

3 

3 

4 

3 

3 

4 

3 

3.3 

Recognition  rate  (%) 

98.9 

97.8 

97.8 

98.3 

98.3 

97.8 

98.3 

98.3 

97.8 

98.3 

98.2 

Number  of  features* 

6 

7 

8 

7 

6 

6 

6 

6 

5 

6 

6.3 

Number  of  rules 

5 

7 

7 

8 

5 

5 

5 

7 

6 

5 

6 

Number  of  term  sets 

14 

21 

23 

18 

14 

14 

•14 

15 

13 

13 

15.9 

*  The  number  of  features  used  in  the  obtained  fiizzy  system  after  deleting  some  one- 
membership-function  features. 


Example  2  -  Iris  Data 

The  Fisher  iris  data  [50]  consist  of  4  measurements:  sepal  length  (SL),  sepal  width 
(SW),  petal  length  (PL)  and  petal  width  (PW),  in  150  data  sets.  Three  iris  species  are 
involved  in  the  measurement:  iris  setosa,  iris  versicolor,  and  iris  virgincia.  Each  species 
contains  50  samples. 
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The  iris  data  is  a  famous  benchmark  to  test  the  performance  of  classifier  systems. 
Table  7.9  shows  the  results  of  some  well-known  classifier  systems.  For  example,  Shi  et 
al.  [55]  applied  an  integer-code  genetic  algorithm  to  learn  a  Mamdani-type  fuzzy  system 
for  classifying  the  iris  data  by  training  on  all  150  patterns.  After  several  trials  with 
different  learning  options,  a  four-rule  fuzzy  system  was  obtained  with  98%  correct 
recognition  (3  misclassifications).  Abe  et  al.  [56]  discussed  a  fuzzy  classifier  with 
ellipsoidal  regions.  They  applied  clustering  techniques  to  extract  fuzzy  rules,  with  one 
rule  around  one  cluster  center,  and  then  they  tuned  the  slopes  of  their  membership 
functions  to  obtain  a  high  recognition  rate.  Finally,  they  obtained  a  fuzzy  classifier  with 
a  recognition  rate  of  98.7%  (2  misclassifications).  Russo  [57]  applied  a  hybrid  GA 
neuro-fiizzy  approach  to  learn  a  fuzzy  model  for  the  iris  data.  He  derived  a  five-rule 
fuzzy  system  with  18  fuzzy  sets  and  0  misclassifications.  Ishibuchi  et  al.  [84]  applied  all 
150  samples  in  the  training  process,  and  derived  a  fuzzy  classifier  with  7  term  sets  and  5 
rules.  The  resolution  was  98.0%  correct  and  3  misclassifications.  Aboyi  et  al  [85] 
proposed  a  new  data-driven  method  to  design  compact  fiizzy  classifiers  via  combining  a 
genetic  algorithm,  a  decision-tree  initialization,  and  a  similarity-driven  rule  reduction 
technique.  The  final  system  had  3  fuzzy  rules  and  4  term  sets.  The  accuracy  is  96.1 1% 
correct  (6  misclassifications). 


Table  7.7  -  Results  of  ten  runs  of  VISIT  on  iris  data 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

Average 

Misclassifications 

1 

3 

1 

2 

1 

2 

2 

2 

1 

2 

1.7 

Recognition  rate 
(%) 

99.3 

98 

99.3 

98.7 

99.3 

98.7 

98.7 

98.7 

99.3 

98.7 

98.87 

Features 

3 

2 

4 

3 

4 

4 

3 

4 

3 

3 

3.3 

Rules 

5 

4 

5 

5 

5 

6 

5 

5 

6 

6 

5.2 

Term  sets 

7 

6 

9 

9 

8 

9 

8 

9 

11 

8 

8.4 

For  the  iris  example,  we  also  used  150  patterns  to  design  a  fuzzy  classifier  system 
via  the  proposed  evolutionary  algorithm.  The  parameters  W,  A,  E,  and  M  were 
initialized  randomly.  The  population  had  200  individuals,  and  the  evolution  processes 
was  stopped  after  100  iterations.  Table  7.7  shows  the  results  of  ten  runs  (i.e.  ten  different 
initializations  of  parameters).  The  final  fuzzy  systems  utilized  an  average  of  3.3  feature 
variables,  8.4  membership  functions  and  5.2  rules  to  get  an  average  recognition  rate  of 
98.87%  (1.7  misclassifications).  The  worst  classification  result  is  98%  correct  (3 
misclassifications).  The  “worst”  system  utilized  2  feature  variables,  6  membership 
functions  and  4  rules.  The  best  classification  result  is  99.3%  correct  (1  misclassification). 
It  misclassified  a  pattern  in  class  Versicolor  into  class  Virginica.  The  corresponding 
“best”  system  utilized  3  feature  variables,  7  membership  functions  and  5  fuzzy  rules. 
Figure  5  shows  the  membership  functions  in  the  best  fii^  classifier.  Table  7.8  shows 
the  fuzzy  rules  used  in  the  best  fiizzy  classifier.  The  obtained  fiizzy  system  is  easily 
inteipreted. 
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SW  PL 


PW 

Figure  7.5  -  VISIT  membership  ftmctions  for  iris  data 


Table  7.8  -  VISIT  classification  rules  for  iris  data 


Rule 

SL  SW 

PL 

PW 

Class 

1 

1 

1 

1 

Setosa 

2 

1 

2 

1 

Versicolor 

3 

2 

2 

1 

Versicolor 

4 

1 

3 

2 

Virginica 

5 

2 

2 

2 

Virginica 

Table  7.9  shows  the  comparison  of  results  between  the  above  fuzzy  classifier 
system  with  other  well-known  classifier  systems  on  total  term  sets,  number  of  rules  and 
recognition  rate.  The  resulting  system  arrives  at  the  highest  degree  of  accuracy  using  the 
smallest  number  of  term  sets. 


Table  7.9  -  Comparison  of  results  on  iris  data 


Term  sets 

Rules 

Recognition  rate  on 
total  data  set  (%) 

Wang  et  al.  [53] 

11 

3 

97.5 

Wu  et  al.  [58] 

9 

3 

96.2 

Shi  et  al.  [55] 

12 

4 

98.0 

Abe  et  al.  [56] 

— 

3 

98.7 

Setnes  et  d.  [51] 

8  or  12 

2or  3 

99.3  or  98.9 

Russo  [57] 

18 

5 

100 

This  chapter 

7 

5 

99.3 

Ishibuchi  et  al.  [84] 

7 

5 

98.0 

Aboyi  et  al  [85] 

4 

3 

98.0 
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To  estimate  the  performance  of  the  proposed  method  on  unseen  data,  the  5-fold 
cross-validation  and  the  Leave-one-out  experiments  were  performed  on  the  iris  data.  In 
the  5 -fold  cross-validation  experiment,  the  normalized  iris  data  were  divided  into  5 
disjoint  groups  containing  30  different  patterns  each,  with  10  patterns  belonging  to  each 
class.  Then  we  derived  fuzzy  systems  via  the  proposed  method  on  all  data  outside  one 
group  and  tested  the  resulting  fuzzy  classifier  on  the  data  inside  that  group.  Finally,  five 
fuzzy  systems  were  derived.  Table  7.10  reports  the  results  of  5-fold  cross  validation. 
The  average  classification  result  is  98.8%  correct  (about  1.4  misclassifications)  on  the 
training  data  and  98.0%  correct  (about  0.6  misclassification)  on  the  test  data  using  4.4 
rules. 


Table  7.10  -  VISIT  5-fold  cross-validation 


1 

2 

3 

4 

5 

Avg 

Rules 

6 

5 

4 

3 

4 

4.4 

Training  patterns 

120 

120 

120 

120 

120  120 

Misclassifications  (training) 

1 

1 

2 

1 

2 

1.4 

Recognition  rate  (training) 
(%) 

99.2 

99.2 

98.3 

99.2 

98.3  98.8 

Testing  patterns 

30 

30 

30 

30 

30 

30 

Misclassifications  (testing) 

0 

0 

1 

1 

1 

0.6 

Recognition  rate  (testing) 
(%) 

100 

100 

96.7 

96.7 

96.7  98.0 

In  the  Leave-one-out  ejqieriment,  we  iteratively  apply  the  proposed  method  on  the  iris 
data  set  150  times.  Every  time,  we  select  a  pattern  as  the  testing  set,  and  other  149 
patterns  as  the  training  set.  Table  7.11  contains  the  information  of  the  derived  fuzzy 
systems,  as  well  as  the  corresponding  training  and  testing  errors.  The  last  row  in  Table 
7.1 1  demonstrates  that  the  average  result  over  150  iterations  is  98.0  %  correct  on  the  test 
set  and  98.4%  correct  on  the  training  set.  This  accuracy  was  obtained  with  an  average  of 
4.75  fuzzy  rules,  1.35  membership  functions  on  SW,  1.43  mfs  on  SL,  3.17  mfs  on  PL  and 
3.03mfeonPW. 


Table  7.1 1-  VISIT  leave-one-out  cross-validation  on  Iris  data 


Recognition  Recognition 
Rate  %  Rate  % 

#  Rules 

SL 

SW 

PL 

PW 

(Training) 

(Testing) 

1 

5.00 

1.00 

1.00 

4.00 

4.00 

98.0 

100 

2 

4.00 

2.00 

1.00 

2.00 

3.00 

98.0 

100 

3 

4.00 

1.00 

2.00 

2.00 

3.00 

98.0 

100 

4 

6.00 

2.00 

2.00 

4.00 

4.00 

98.7 

100 

5 

5.00 

2.00 

2.00 

3.00 

2.00 

98.7 

100 

6 

4.00 

1.00 

2.00 

2.00 

3.00 

98.0 

100 

104 


7 

4.00 

1.00 

2.00 

2.00 

3.00 

98.0 

100 

8 

4.00 

2.00 

1.00 

2.00 

3.00 

98.0 

100 

9 

6.00 

2.00 

2.00 

4.00 

4.00 

99.3 

100 

10 

5.00 

2.00 

2.00 

3.00 

2.00 

99.3 

100 

11 

4.00 

1.00 

1.00 

3.00 

2.00 

98.0 

100 

12 

5.00 

2.00 

2.00 

2.00 

3.00 

99.3 

100 

13 

6.00 

2.00 

1.00 

4.00 

4.00 

98.7 

100 

14 

5.00 

1.00 

1.00 

4.00 

4.00 

98.7 

100 

15 

5.00 

2.00 

1.00 

3.00 

2.00 

98.7 

100 

16 

5.00 

2.00 

2.00 

3.00 

2.00 

99.3 

100 

17 

5.00 

1.00 

1.00 

4.00 

4.00 

98.0 

100 

18 

4.00 

1.00 

1.00 

3.00 

3.00 

98.0 

100 

19 

4.00 

1.00 

1.00 

3.00 

2.00 

98.0 

100 

20 

5.00 

1.00 

1.00 

4.00 

4.00 

98.0 

100 

21 

5.00 

1.00' 

1.00 

4.00 

4.00 

98.0 

100 

22 

4.00 

1.00 

2.00 

2.00 

3.00 

98.0 

100 

23 

4.00 

1.00 

2.00 

2.00 

3.00 

98.0 

100 

24 

5.00 

1.00 

2.00 

3.00 

2.00 

99.3 

100 

25 

5.00 

1.00 

1.00 

4.00 

4.00 

98.0 

100 

26 

4.00 

1.00 

2.00 

2.00 

3.00 

98.0 

100 

27 

5.00 

1.00 

1.00 

4.00 

4.00 

98.0 

100 

28 

6.00 

1.00 

2.00 

4.00 

4.00 

99.3 

100 

29 

4.00 

2.00 

1.00 

2.00 

3.00 

98.0 

100 

30 

4.00 

1.00 

1.00 

3.00 

3.00 

98.0 

100 

31 

4.00 

1.00 

1.00 

3.00 

3.00 

98.0 

100 

32 

5.00 

1.00 

1.00 

4.00 

4.00 

98.0 

100 

33 

5.00 

1.00 

2.00 

3.00 

2.00 

98.7 

100 

34 

4.00 

1.00 

1.00 

3.00 

3.00 

98.0 
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F.Tcamnle  3  -  Wisconsin  Breast  Cancer  Data 

The  Wisconsin  breast  cancer  data  contains  699  patterns  for  two  cancer  states, 
“benign”  and  “malignant.”  The  nine  features  involved  are  clump  thickness  (CT), 
uniformity  of  cell  size  (UC),  uniformity  of  cell  shape  (UCS),  marginal  adhesion  (MA), 
single  epithelial  cell  size  (SECS),  bare  nuclei  (BN)»  bland  chromatin  (BC),  normal  nuclei 
(NT^  and  mitoses  (MI).  A  total  of  683  patterns  are  used  to  evaluate  the  performance 
tecause  the  other  16  patterns  contain  missing  features.  Of  683  valid  patterns,  444 
patterns  pertain  to  benign  class,  and  239  patterns  to  malignant  class.  The  proposed 
method  is  applied  to  design  a  fuzzy  classifier  for  the  Wisconsin  breast  cancer  data.  The 
population  size  was  200.  To  compare  the  performance  with  other  classifiers,  the  first  400 
patterns  are  used  as  the  training  set  in  the  simulation,  and  the  other  283  patterns  as  the 
testing  set.  After  100  iterations,  the  evolutionary  process  was  terminated. 
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Uniformity  of  Cell  Shape  Normal  Nucleoli 


Figure  7.6  -  VISIT  membership  fimctions  for  Wisconsin  breast  cancer  data 


Table  7.12  —  VISIT  classification  rules  for  Wisconsin  breast  cancer  data 


CT 

UC  UCS  MA  SECS 

BN  BC  NN  MI 

Class 

1 

1 

1 

1 

2 

2 

1 

2 

3 

1 

2 

2 

The  proposed  algorithm  created  only  a  single  membership  function  for  features 
clump  thickness  (CT),  uniformity  of  cell  size  (UC),  marginal  adhesion  (MA),  single 
epithelial  cell  size  (SECS),  bare  nuclei  (BN),  bland  chromatin  (BC)  and  mitoses  (MI). 
After  deleting  these  redundant  features,  the  resulting  fuzzy  system  is  quite  simple  with 
only  two  input  features  and  three  rules.  Figure  7.6  shows  the  membership  functions  for 
features  UCS  and  MN.  Table  7.12  shows  the  three  fuzzy  rules.  The  classifier  system 
generates  14  misclassifications  (97.5%  correct)  on  400  training  patterns,  and  10 
misclassifications  (96.5%  correct)  on  283  testing  patterns.  This  leads  to  a  recognition 
rate  of  96.5%  (24  misclassifications)  on  the  total  data  set. 

Table  7.13  shows  a  comparison  between  the  proposed  method  and  other  published 
models.  Wang  and  Lee  [53]  applied  half  of  the  total  683  data  sets  as  the  training  set  and 
the  other  half  as  the  testing  data.  They  developed  a  2-rule  and  18-membership-function 
classifier  system  with  a  recognition  rate  of  96.3  %  (about  25  misclassifications)  in  10 
trials  performed  on  10  different  training  sets.  Nauck  et  aL  [59]  applied  all  patterns  to 
training  their  fuzzy  classifier.  They  initialized  the  NEFCLASS  system  with  3  fuzzy  rules 
obtained  by  a  modified  GK  clustering  algorithm  called  “GK  parallel.”  After  80  epochs  of 
training,  the  classification  rate  was  improved  to  92.7%  correct  (50  misclassifications). 
Ishibuchi  et  al  [82]  also  applied  ten-fold  cross  validation  procedures  on  the  breast  cancer 
dataset  to  test  their  NSGA-II  based  three-objective  genetic  rule  selection  method.  The 
best  result  from  this  method  is  a  fiizzy  system  with  4  rules  of  average  length  1.5.  The 
classification  rate  for  this  system  is  97.6%  correct  (15  misclassifications)  on  the  training 
data  and  95.6%  (3  misclassifications)  correct  on  the  test  data.  In  [85],  Aboyi  et  al  also 
utilized  their  data-driven  method  to  design  fuzzy  classifiers  for  classifying  cancer  data. 
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The  best  result  is  one  with  2  rules,  3  term  sets,  and  96.5%  accuracy  (24 
misclassifications). 


Table  7.13  -  Comparison  results  on  Wisconsin  breast  cancer  data 


Term  sets 

Rules 

Recognition  rate  on 
total  data  set  (%) 

Wang  et  al.  [53] 

18 

2 

96.3 

Nauck  et  al.  [59] 

27 

3 

92.7 

Ishibuchi  et  al  [82] 

>4* 

4 

97.4* 

Aboyi  et  al  [85] 

3 

2 

96.5 

This  chapter 

4 

3 

96.5 

*  These  data  are  estimated  via  the  information  from  [82] 


F.Tamnle  4  -  Pima  Indians  Diabetes  Data 

The  Pima  Indian  diabetes  data  set  contains  768  patterns  with  8  features.  This  data 
set  has  significant  class  overlap,  hence  classification  is  somewhat  more  difidcult  in  this 
case.  The  dataset  is  divided  into  two  classes,  indicating  whether  a  Pima  Indian  individual 
is  diabetes  positive  or  negative.  The  features  are  based  on  measurements  of  Number  of 
Times  Pregnant  (NTP),  Plasma  Glucose  Concentration  (PGC),  Diastolic  Blood  Pressure 
(DBP),  Triceps  Skin  Fold  Thickness  (TSFT),  2-Hour  Serum  Insulin  (2HSI),  Body  Mass 
Index  (BMI),  Diabetes  Pedigree  Function  (DPF),  and  Age  (AGE). 


Plasma  Glucose  Concentration  Body  Mass  Index 


Figure  7.7  -  VISIT  membership  functions  for  Pima  data 


In  our  experiment,  the  first  400  data  pairs  are  used  for  the  training  set,  and  the  other 
368  data  pairs  are  used  for  testing.  The  population  size  was  200,  and  the  training  process 
is  stopped  after  100  iterations.  After  deleting  redimdant  features,  the  resulting  fijzzy 
system  is  compact  and  simple,  having  only  two  inputs,  three  fuzzy  rules,  and  a  total  of 
four  term  sets.  Figure  7.7  shows  the  membership  functions  for  feature  variables  NTP  and 
2HSI.  Table  7.14  shows  the  three  fuzzy  rules.  The  recognition  rate  of  the  classifier 
system  is  75.8%  correct  on  400  training  patterns,  78.2%  correct  on  368  testing  patterns 
and  77.0%  on  the  total  data  set. 
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Table  7. 14  -  VISIT  classification  rules  for  Pima  data 


NTP 

PGC  DBP  TSFT 

2HSI  BMI  DPF  AGE 

Class 

1 

1 

1 

1 

2 

1 

2 

2 

3 

2 

2 

2 

Aboyi  et  al.  [85]  also  utilized  the  Pima  dataset  to  test  their  algorithm.  In  their  5- 
fold  cross  validation  experiments,  the  average  fuzzy  system  had  11.2  fiizzy  rules,  40 
fuzzy  sets  and  an  accuracy  rate  of  73.05%.  These  results  indicate  that  our  algorithm 
performs  well  even  when  there  is  significant  class  overlap. 


7.6  Discussion  and  Conclusion 

This  chapter  presents  an  automatic  method  to  design  fuzzy  systems  for 
classification  via  evolutionary  optimization.  Different  fuzzy  systems  are  defined  on  the 
same  data  set  via  the  VISIT  algorithm  by  randomly  assigning  different  initial  parameters 
W,  A,  E,  and  M.  The  evolutionary  algorithm  finds  the  optimal  fuzzy  system  through 
simultaneously  optimizing  the  parameters.  Without  any  a  priori  loiowledge  of  the 
distribution  of  the  training  data,  the  proposed  approach  can  automatically  develop  a 
parsimonious  fiizzy  system  fi’om  scratch.  Simulation  results  performed  on  the  wine  data, 
iris  data,  Wisconsin  breast  cancer  data,  and  Pima  Indian  diabetes  data  illustrate  the  power 
of  the  proposed  approach. 

The  proposed  algorithm  is  simple,  and  it  does  not  require  any  techniques  such  as 
various  clustering  algorithms  to  provide  the  initial  models  as  the  basis  for  further  tuning. 
Many  other  methods  in  Tables  7.5,  7.9  and  7.11  do  need  initial  structure  learning  to 
determine  the  initial  stmcture  of  the  fiizzy  system,  including  the  rules,  the  shape  of 
membership  functions,  the  number  of  rules  and  the  number  of  membership  functions 
associated  with  each  feature.  For  example,  Wang  and  Lee  [53]  applied  a  MAC  algorithm 
to  find  the  initial  structure  and  use  a  fast  recursive  linear/nonlinear  least-squares 
algorithm  to  tune  parameters.  Setnes  et  al.  [51]  applied  a  fuzzy  clustering  technique  to 
develop  the  structure  of  the  fiizzy  model,  and  then  tuned  the  structure  and  parameters  via 
a  real-code  GA  combined  with  similarity-driven  simplification.  Shi  et  al.  [55]  utilized 
LVQ  to  detect  the  structure  of  the  fuzzy  system,  then  tuned  rules  and  membership 
functions  by  an  integer-coded  GA. 

For  the  evolutionary  algorithm  used  in  our  method,  a  fiizzy  eiqiert  system  acts  as 
the  fitness  function  to  evaluate  individuals.  The  value  of  the  MCE  criterion  and  the 
number  of  rules  are  the  inputs  to  the  fiizzy  eiqiert  system.  Therefiire,  the  evolutionary 
algorithm  can  investigate  the  accuracy  and  size  of  the  rule  base  of  the  fiizzy  system 
simultaneously,  and  leads  to  an  accurate  fiizzy  system  with  a  compact  rule  base. 
Moreover,  we  can  shift  the  surface  of  the  fitness  function  according  to  our  preference  or 
expert  knowledge  by  adjusting  the  constant  consequents  in  the  fuzzy  expert  system  rules 
and  its  membership  functions. 
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8  Fuzzy  PD+I  Learning  Control  for  a  Pneumatic  Muscle 


8.1  Introduction 

Fu22y  control  [60]-[63]  has  been  used  in  commercial  and  industrial  applications  in 
recent  years.  The  fuz2y  model  reference  learning  control  (FMRLC)  algorithm  developed 
from  the  general  ideas  in  linguistic  self-organizing  control  and  conventional  model 
reference  adaptive  control  [64].  FMRLC  has  a  learning  ability  that  enhances  the 
performance  over  time,  so  that  it  can  improve  the  closed-loop  system  by  using  measured 
data  from  the  plant’s  output  to  update  the  controller.  The  FMRLC  technique  has  been 
used  in  the  control  of  the  cargo  ship  steering  [65]  and  it  has  also  been  employed  to 
improve  the  performance  of  the  anti-skid  braking  system  in  various  road  conditions  [66]. 

In  this  chapter,  we  utilize  ideas  from  FMRLC  to  design  a  learning  controller  that 
results  in  length  tracking  for  a  pneumatic  muscle.  The  controller  is  an  adjustable  PD  type 
with  a  nonfiizzy  parallel  integral  branch.  The  controller  consequent  singleton  locations 
are  dynamically  adjusted  by  a  fuzzy  inverse  model.  The  inverse  model  has  for  one  of  its 
inputs  the  interior  pressure  of  the  PM.  This  is  effective  in  this  case  because  the  PM 
behaves  differently  when  inflated  as  compared  to  when  deflated. 


8.2  PM  Experimental  Apparatus 

The  laboratory  equipment  used  in  connection  with  the  ejqreriments  reported  in  this 
chapter  includes  the  PM  itself,  the  supporting  frame,  the  inflation/deflation  apparatus,  the 
interfrcing  electronics,  and  the  computer  hardware  and  software  used  to  implement  the 
controller. 

The  pneumatic  muscle  is  constructed  of  a  rubber  irmer  bladder  made  from  a  section 
of  7/8  inch  diameter  bicycle  tire  tubing  enclosed  in  a  plastic  sheath  material.  The  bicycle 
tubing  is  size  27-S-143,  and  the  plastic  mesh  shell  is  1.25  inch  plastic  sheathing  used  in 
electrical  coaxial  cable.  For  the  e:q>eriments  reported  in  this  chapter,  the  PM  assembly  is 
hung  vertically  from  the  supporting  structure.  The  upper  end  of  the  PM  is  hose-clamped 
to  an  adapter,  which  is  attached  to  a  source  of  con^ressed  gas.  The  lower  end  is  attached 
to  a  mass  that  is  freely  suspended.  The  upper  end  is  also  connected  to  air  supply  and 
drain  proportional  valves  that  regulate  the  amormt  of  gas  in  the  PM  at  any  time.  The 
valves  used  are  two-way  proportional  valves  that  respond  to  voltage  signals  (Festo  Corp. 
model  MPPE-3-1/8-10-0101B) 

The  supporting  stricture  consists  of  an  aluminum  cage  surrounded  by  plexiglass. 
This  structure  supports  and  surrounds  the  PM  and  the  mass.  The  inflation/deflation 
apparatus  consists  of  an  outside  source  of  compressed  gas  connected  to  the  PM  by  an 
input  gas  line. 

The  computer  hardware  for  the  results  in  this  chapter  consists  of  a  200MHz  PC 
with  16Mb  memory  interfacing  with  a  general  purpose  ISA  bus  I/O  card  with  digital  I/O 
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and  ADCs,  along  with  a  custom  electronic  interface  control  box  with  connectors.  This 
box  can  select  sampling  rates  of  1/64  second  to  1/14400  second.  Sensors  available  on  the 
system  include  a  pressure  transducer  for  measuring  PM  interior  pressure,  a  force 
transducer  measuring  the  force  applied  by  the  PM,  and  a  linear  distance  transducer  used 
to  measure  the  PM  length. 

The  control  programs  are  implemented  using  Borland  Turbo  C/C-H-  with  a 
sampling  time  of  1/64  second.  A  sampling  time  of  1/128  second  also  works,  but 
performance  is  essentially  identical  to  1/64  second. 


8.3  Fuzzy  Learning  Control 

Fuzzy  control  offers  several  advantages  over  conventional  control  techniques.  An 
important  advantage  for  problems  such  as  tracking  control  of  PMs  is  that  the  plant  model 
is  not  required  to  be  in  any  particular  form,  or  even  that  the  form  is  known.  Fuzzy 
control  is  rather  based  on  plant  performance.  An  extension  of  fiizzy  control  is  the  fozzy 
learning  control  methodology.  In  this  method,  the  fiizzy  controller  is  not  fixed,  but  rather 
it  is  dynamically  adjusted  by  a  fuzzy  inverse  model.  The  inverse  model  incorporates 
expert  knowledge  about  how  the  plant  should  behave  and  how  the  controller  should  be 
adjusted  if  the  plant  does  not  behave  as  desired.  The  inverse  model  dynamically  adjusts 
the  locations  of  the  consequent  centers  of  the  fuzzy  controller  according  to  the  measured 
behavior  of  the  plant. 

Our  fuzzy  learning  controller  is  shown  in  Fig.8.1.  It  has  two  main  parts:  the  fuzzy 
controller  and  the  learning  mechanism,  which  includes  the  fuzzy  inverse  model  and 
knowledge-base  modifier.  We  explain  each  of  these  below. 


Figure  8.1-  Fuzzy  learning  PD  + 1  tracking  controlla- 
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Fii77v  PD+I  Controller 

The  controller  is  of  PID  type  with  a  fuzzy  PD  part  and  a  nonfuzzy  integral  branch. 
The  fuzzy  part  has  inputs  e{kT)  =  r(kT)-y{kT)  and  c{kT)  =  ie{kT)-e{{k-\yr))IT 
where  r(JcT)  is  the  reference  signal  to  be  tracked  and  T  is  the  sampling  interval.  The 
controller  output  is  the  commanded  PM  pressure  p{kT) .  The  integral  Iwanch  of  the 
controller  is  nonfuzzy  because  we  have  no  expert  knowledge  about  how  to  adjust 
consequent  centers  for  this  branch  and  fuzziness  is  not  needed  for  any  other  reason  (e.g. 
making  the  integral  path  nonlinear). 

For  the  fuzzy  part  of  the  controller,  we  define  1 1  fiizzy  sets  on  the  e  universe  of 
discourse  (v4'(e), /  =  1,...,11)  and  11  fuzzy  sets  on  the  c  universe  of  discourse 

(y4'(c),/  =  l,...,ll).  These  fuzzy  sets  are  characterized  by  11  equally-spaced 

symmetrical  triangular  membership  functions  centered  at  [-1,  -.8,  -.6,  -A,  -2,  0,  .2,  .4,  .6, 
.8, 1]  shown  in  Figure  8.2. 


Figure  8.2  -  e,  c  input  fiizzy  sets  for  controller  and  inverse  model 
(linguistic-numeric  values  above  memberships) 


The  choice  of  scaling  gains  g^,  and  generally  come  from  the  experience  of 

the  designer.  As  a  first  guess,  the  gain  g^  is  chosen  so  that  the  range  of  e(kT)  will  lie  in 
the  interval  [-1,  +1],  i.e.  so  that  the  values  of  e(kT)  do  not  result  in  the  saturation  of  the 
outermost  membership  functions.  The  gain  g^  is  chosen  in  the  same  way,  so  that  the 
value  of  will  not  saturate  in  its  outermost  membership  fimctions.  We  choose  g^  to 

insure  that  signals  input  to  the  plant  from  the  controller  will  not  exceed  the  allowed  range 
of  the  input  to  the  plant. 

The  knowledge-base  in  the  fuzzy  controller  has  two  parts,  the  rule-base  and  the 
fuzzy  sets.  The  rule-base  for  the  fuzzy  controller  contains  the  IF-THEN  control  rules  of 
the  form: 


Rule  /:  If  e  is  and  c  is  C*  then  p  is 


(8.3.1) 


where  e  and  c  denote  the  linguistic  variables  associated  with  the  fuzzy  controller  inputs 
eikT^  and  c{\(F') ,  p  denotes  the  linguistic  variable  associated  with  the  fiizzy  controller 

output  p{)<T) ,  W  and  C*  denote  the  y'*  and  linguistic  value  associated  with  e 

and  c  ,  and  denotes  the  f'  linguistic  value  associated  with  p . 

The  controller  rule  base  consists  of  11x11  =  121  rules,  each  rule  with  a  separate 
consequent.  Therefore  we  will  have  121  singleton  output  memberships  wiA  the 
consequent  membership  for  rule  /  at  b.,i  =  \,...,\2\.  The  input  membership  functions 

are  fixed  and  are  not  tuned.  The  locations  of  the  output  singletons  are  assumed  to  be 
unknown,  and  will  be  tuned  by  the  fiizzy  inverse  model. 

The  degree  to  which  a  rule  is  "on"  at  time  kT  is  calculated  using  the  min  T-norm. 
This  gives  rise  to  a  premise  membership  function  for  Rule  /  (8.3.1)  calculated  as 


Mpremiseii)  ie{kT)\M^k  icikT)))  (8.3.2) 

e  c 

for  /■=!,...,  121.  Finally  we  use  the  center-average  defuzzification  technique  for  the 
inference  mechanism.  Therefore,  the  input  to  the  PM,  which  is  pressure,  is  calculated 
through  the  defuzzification  process  as 


p{kT)  = 


(8.3.3) 


where  the  summation  is  taken  over  all  121  rules  and  b.{kT)  denotes  the  center  of  the 

membership  function  as  a  consequence  of  the  rule  at  time  kT.  The  centers 
&j.(itr),/  =  l, ...,121  are  dynamically  adjusted  by  the  fiizzy  inverse  model  aijd  the 

learning  mechanism. 

The  integral  branch  is  a  standard  nonfiizzy  integral  function  of  the  tracking  error. 

A  constant  multiple  of  the  integral  of  the  error  is  added  to  the  fiizzy  PD  output,  making 
the  controller  essentially  a  PID  controller  in  which  the  P  and  D  parts  are  fiizzy  and 
adjustable,  while  the  I  part  is  nonfiizzy  and  fixed.  The  addition  of  the  integral  of  the 
tracking  error  serves  to  reduce  steady-state  error.  The  integral  gain  is  chosen  so  that 

steady-state  error  is  small.  If  g,  is  too  large,  however,  the  tracking  performance  becomes  < 
sluggish. 

Incidentally,  a  controller  similar  to  this  was  tried  also,  in  which  the  integral  part  as 
well  as  the  P  and  D  part  was  fuzzy,  making  a  true  fiizzy  PID  learning  controller. 
However,  not  much  success  resulted  from  this  controller  due  to  the  fact  that  tuning  was 
much  more  difficult  with  the  integral  branch  of  the  controller  implemented  as  a  fuzzy 
system.  Therefore,  the  I  branch  has  been  left  fixed  and  nonadjustable  in  this  research. 
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T  .earning  Mechanism 

The  learning  mechanism  consists  of  a  fiKzy  inverse  model  and  knowledge-base 
modifier.  The  fuzzy  inverse  model  is  a  fuzzy  system  with  inputs  g^e(kT),  g^c{kT) ,  and 
PM  internal  pressure  p{kT) .  The  inverse  model  output  Ab(kT)  determines  the  amount 
that  the  fuzzy  controller  output  memberships  are  changed  at  each  time  step.  We  choose 
pressure  to  be  the  third  inverse  model  input  since  pressxire  has  a  direct  effect  on  PM 
characteristics  (2.3).  Simulations  indicate  that  controller  P  and  D  gains  should  be  larger 
at  low  PM  pressures  than  at  high  PM  pressures.  This  is  due  to  the  fact  that  it  requires 
more  pressure  input  to  effect  a  given  amount  of  shortening  when  the  PM  is  deflated  than 
when  it  is  inflated 

The  e  and  c  input  xmiverses  for  the  inverse  model  each  have  11  fiizzy  sets 
characterized  by  11  equally-spaced  symmetrical  triangular  membership  functions 
identical  to  those  of  the  controller  (see  Figme  8.2).  The  third  input  p  to  the  inverse  model 
has  a  universe  of  discourse  consisting  of  1 1  fiizzy  sets  characterized  by  1 1  equally-spaced 
symmetrical  triangular  membership  functions  centered  at  [0,  .1,  .2,  .3,  A,  .5,  .6,  .7,  .8,  .9, 
1],  shown  in  Figure  8.3.  Therefore,  the  inverse  model  has  11x11x11  =  1331  rules. 


Figure  8.3  -  Fuz2y  sets  for  p  input  to  inverse  model 
(linguistic-numeric  values  above  memberships) 

The  output  fuzzy  sets  for  the  inverse  model  are  17  singletons  spaced  as  [-1,  -.8,  -.6, 
-.5,  -.4,  -.3,  -.2,  -.1,  0,  .1,  .2,  .3,  .4,  .5,  .6,  .8,  1]  in  units  of  pressure  (see  Figure  8.4).  The 
spacing  is  asymmetrical  to  give  more  change  in  pressure  when  the  PM  is  running  at  low 
pressures  (i.e.  inflated  a  small  amount,  which  corresponds  to  longer  lengths)  and  less  at 
high  pressures  (more  inflation,  shorter  lengths).  Tlie  output  hh{kT)  of  the  inverse  model 

is  calculated  similarly  to  the  output  of  the  fuzzy  controller  of  the  previous  section,  i.e. 
minimum  T-norm  and  center  average  defuzzification. 
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Figure  8.4  -  Output  singletm  fuz2y  sets  for  inverse  model 
(linguistic-numoic  values  above  membersliips) 


Two  representative  parts  of  the  inverse  model  rule  base  are  shown  in  Figures  8.5a 
and  8.5b.  Note  that  the  tables  list  the  linguistic-numeric  values  for  the  17  output 
membership  functions  of  the  inverse  model  that  are  defined  on  the  universe  of  discourse 
between  the  interval  [-1,  +1].  For  example,  when  the  error  and  derivative  error  are  small 
and  positive,  and  the  pressure  is  low,  we  will  find  the  output  membership  ftmction  center 
is  at  +0.8  (see  highlighted  box  in  Figure  8.5a).  This  means  a  relatively  large  positive 
increment  should  be  added  on  the  process  input  p(kT)  to  insure  the  y(kT)  will  not 
continue  to  decrease.  On  the  other  hand,  for  the  same  values  in  error  and  derivative  error, 
but  the  pressure  is  now  high,  the  center  is  at  +0.4  (see  highlighted  box  in  Figure  8.5b), 
which  means  we  only  need  half  as  much  adjustment  when  the  PM  is  running  in  high 
pressure. 
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Figure  8.5a  -  One  page  of  inverse  model  rule  base  for  low  PM  internal  pressure 
{p  linguistic-numeric  values  0-4) 
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Figure  8.5b  -  One  page  of  inverse  model  rule  base  for  high  PM  internal  pressure 
(p  linguistic-numeric  values  5-10) 


The  knowledge-base  modifier  performs  the  modification  of  the  PD  part  of  the 
fuzzy  controller  to  improve  tracking  performance.  The  knowledge-base  modifier 
changes  the  knowledge-base  of  the  fuzzy  controller  by  adjusting  the  oiitput  centers  of  the 
rules  that  are  "on”  at  time  kT  for  the  PD  part  Of  the  controller  according  to  the  update  law: 

b.(kT)  =  b.((k-\)T)  +  /!ib(kT)  (8.3.4) 

Other  update  laws  include  (i)  tempering  the  amoimt  of  update  Ab(A:7’)  by  the 
degree  to  which  that  particular  rule  is  “on”  at  a  given  time,  and  (ii)  updating  the 
consequents  for  rules  that  were  “on”  n  time  steps  in  the  past,  where  n  is  the  estimated 
time  delay  of  the  plant.  Neither  of  these  schemes  provided  any  improvement  over  the 
update  law  in  (8.3.4). 

Note  that  at  each  time  step  only  a  few  output  membership  function  centers  are 
changed  or  updated,  and  the  others  are  left  unchanged.  In  every  single  time  step,  the 
centers  changed  only  correspond  to  the  rules  that  are  “on.”  Therefore  no  more  than  4 
centers  are  changed  for  a  single  time  step  in  our  learning  controller.  This  is  because  only  " 
two  triangular  membership  functions  can  be  enabled  in  each  universe  of  discourse  in  one 
time  step  (i.e.,  2  inputs  will  only  turn  on  a  maximum  of  2^  =  4  rules  at  any  one  time). 

Actually,  the  controller  learns  a  new  situation  and  also  remembers  it  for  the  next 
time  this  situation  is  encountered.  Hence,  the  next  time  we  get  similar  values  for  the 
error,  the  controller  can  handle  it  better.  This  is  the  reason  we  call  it  “learning,"  rather 
than  adaptive  control. 
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8.4  Tracking  Results 

In  this  section  we  compare  simulation  and  actual  experimental  results  for  tracking 
control  of  the  pneumatic  muscle.  The  simulation  is  carried  out  to  conform  as  closely  as 
possible  to  laboratory  conditions  at  the  PM  test  station  in  the  Human  Effectiveness  lab  at 
Wright-Patterson  Air  Force  Base  in  Ohio. 

Simulation  Results 

We  can  write  equation  (2.3.1)  in  state  variable  form  as  follows: 

X,  =  ^2 

1  (  X  (8.4.1) 

^2  +P-Mg) 

where  Xj  =  3^ ,  Xj  =  y ,  g  =  9.8  m/sec^,  and  M  =  29.9  kg.  These  equations  are  simulated 

using  a  4*-order  Rimge-Kutta  algorithm  with  a  step  size  of  0.001  seconds. 

The  fuzzy  PD+I  learning  controller  is  simulated  with  a  sampling  interval  of  1/64 
seconds.  The  controller  gains  are  chosen  to  be  =  1 ,  g,.  =  0.1 ,  g^  =  0.01  and 

g  — 100 .  The  inverse  model  gains  are  chosen  to  be  =  .01  and  g^^  =  0.02 . 

The  initial  position  of  the  mass  is  0  inch,  corresponding  to  the  PM  folly  deflated 
and  extended.  The  command  is  a  combination  sinusoid/triangle  signal.  The  tracking 
performance  of  the  simulation  is  shown  in  Figure  8.6. 
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Figure  8.6  -  Simulation  tracking  performance 


Experimental  Results 

The  PM  was  loaded  with  a  mass  of  29.9  kg.  The  fu2zy  learning  controller  was 
implemented  with  a  sampling  time  of  1/64  seconds.  By  extensive  tuning,  the  best 
controller  gains  were  determined  to  be  =1,  g.  =0.1,  g^-  0.01  and  g^  =  100 .  The 

inverse  model  gains  are  chosen  to  be  =  .01  and  =0.02.  Thus  the  controller 

exactly  conforms  to  the  controller  of  the  simulation. 

The  initial  position  of  the  mass  is  0  inch,  corresponding  to  the  PM  fully  deflated 
and  extended.  The  command  is  the  same  combination  sinusoid/triangle  signal  used  in  the 
simulation.  The  experimental  tracking  perfomiMce  is  shown  in  Fig.  8.7. 
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Figure  8.7  -  Experimental  tracking  performance 


8.5  Discussion 

Our  simulation  and  ejqperimental  results  are  fairly  close,  lending  credibility  to  the 
model  derived  in  [7].  The  tracking  performance  in  the  simulation  can  be  improved  over 
that  of  the  experimental  results  by  increasing  the  scaling  gains.  However,  these  gains  do 
not  provide  acceptable  tracking  when  applied  to  the  actual  PM  in  the  laboratory,  and  so 
are  not  presented  here. 

Experimental  performance  is  inevitably  different  from  simulation  performance  due 
to  nonideal  conditions  present  in  the  laboratory  which  are  not  accoimted  for  in  the 
simulation.  Nonideal  conditions  in  the  lab  include  the  fact  that  the  supporting  structure 
for  the  PM  assembly  is  not  absolutely  rigid,  giving  rise  to  vibrations  in  the  PM  motion 
(see  Figure  8.7  especially  at  deflation),  the  fact  that  PM  length  and  pressure 
measurements  are  not  exact,  and  the  PM  valves  are  not  ideal. 

It  can  be  observed  from  Figures  8.6  and  8.7  that  initial  learning  occurs  during  the 
first  1  -  2  seconds.  Afrer  this,  tracking  is  fairly  accurate.  Tracking  performance  can  be 
ejq)ected  to  improve  over  time  as  more  different  situations  (i.e.  combinations  of  e,  c,  and 
p)  are  encountered  and  learning  becomes  more  complete. 
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8.6  Conclusions 

The  problem  addressed  in  this  chapter  is  position  tracking  control  for  a  pneiunatic 
muscle  actuator.  Since  the  coeificients  in  the  PM  are  poorly  known  and  vary  with  time, 
we  use  a  fuzzy  PID-type  tracking  controller  with  learning  ability.  The  controller  consists 
of  an  adjustable  PD  ftuzzy  part  and  a  parallel  nonfiizzy  integral  branch.  A  comparison  of 
simulation  and  experimental  results  shows  general  agreement  between  the  two,  lending 
credibility  to  a  recently-derived  PM  model. 


122 


9 


Fuzzy  Control  for  Pneumatic  Muscle  Tracking  via 
Evolutionary  Tuning 


9.1  Introduction 

In  general,  fuzzy  controllers  have  many  parameters  to  be  determined.  The  tuning 
of  these  parameters  is  typically  done  by  a  time-consuming  trial  and  error  process. 
Therefore,  automatic  tuning  of  design  parameters  is  advantageous.  Some  studies  ([67]- 
[69])  on  evolutionary  computation  have  shown  that  evolutionary  algorithms  (EAs) 
provide  powerful  ways  to  efficiently  search  in  poorly  understood,  irregular  spaces 
inq)ired  by  the  mechanism  of  natural  evolution.  Following  their  successful  applications 
in  many  difficult  optimization  problems,  EAs  are  becoming  increasingly  used  in  the 
design  of  fiizzy  systems.  Work  in  EA-based  design  of  fuzzy  systems  can  be  divided  into 
two  categories:  rule-base  construction  and  rule-base  tuning.  For  rule-base  construction, 
EAs  are  used  for  finding  rules  ([55],  [70])  and  determining  the  number  of  rules  [71]  fi:om 
a  data  set.  For  rule-base  tuning,  EAs  are  used  for  optimizing  membership  functions  in 
the  fuzzy  system  according  to  some  performance  criterion  [72]. 

In  this  chapter,  we  design  a  fhzzy  P+ID  controller  for  an  actual  pneumatic  muscle 
system  located  in  the  Human  Sensory  Feedback  Laboratory  at  Wright-Patterson  Air 
Force  Base  in  Dayton,  Ohio.  For  this  purpose,  a  recurrent  neuro-fuzzy  model  of  the  PM 
is  utilized,  and  on  the  basis  of  this  model  a  modified  evolutionary  algorithm  is  used  to 
tune  the  controller  parameters.  The  resulting  controller  is  then  applied  to  the  actual  PM 
with  excellent  tracking  results. 


9.2  Evolutionary  Fuz^  P+BD  Control 

Fuzzy  controllers  are  gaining  popularity  in  the  control  community  due  to  their 
capabilities  of  dealing  with  uncertainty,  but  their  systematic  design  is  still  an  open 
problem.  A  typical  fuzzy  controller  foimd  in  the  literatxire  often  has  many  parameters, 
some  of  which  are  intercorrelated  and  co-dependent.  It  is  often  impossible  to  perceive 
the  relationship  between  closed-loop  performance  and  the  parameters  of  the  fiizzy 
controller.  This  increases  the  difficulty  of  design  of  fiizzy  controllers.  For  these  reasons, 
we  propose  an  evolutionary  fuz:^  P+ID  controller  for  PM  control. 

FiizTv  P+ID  Controller 

A  fiizzy  P+ID  controller,  shown  in  Figure  9.1,  is  constructed  using  an  incremental 
fuzzy  logic  controller  with  output  Axj.(k)  =  FLC{e,e)  in  place  of  the  proportional  term 

in  a  conventional  PID  controller.  The  PM  input  is  a  voltage  x{k)  which  is  expressed  in 
incremental  form  as 
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Ax:(^)  =  x{k)  -  x{k  - 1) 


=  KpS^hXfik)  +  KjTeik)  -  Kj^ 


y(k)-2y(k-\)  +  y{k-2) 
T 


(9.2.1) 


In  (9.2.1),  Kp,  /T/,  and  Kj^  are  PID  control  gains,  is  the  output  scaling  gain 
of  the  fuzzy-P  subsystem,  and  T  is  the  step  size.  In  general,  scaling  gains  and  in 
Figure  9.1  are  chosen  so  that  values  of  S^e  and  lie  in  the  interval  [-1,  1],  and 
is  often  chosen  to  be  equal  to  the  reciprocal  of  In  our  approach,  these  scaling  gains 

will  be  determined  by  an  evolutionary  algorithm. 

To  reduce  the  number  of  control  parameters,  we  design  membership  fimctions  for 
the  fuzzy-P  controller  with  the  following  assmnptions: 

1.  e{k)  and  e(k)  universes  of  discourse  have  five  triangular  membership  functions,  i.e. 
NB,  NS,  ZO,  PS  and  PB.  These  attain  their  maximum  value  of  unity  at  -  fi,  -  a ,  0, 
« ,  and  yff  respectively  and  form  a  partition  of  unity.  This  number  of  memberships  is 
chosen  to  keep  the  evolutionary  optimization  task  relatively  simple  while  maintaining 
good  control  performance. 

2.  Axy(k)  universe  has  five  singleton  membership  functions  located  at  -;ff(NB), 
-or  (NS),  0(ZO),  a  (PS)  and  >9(PB). 

A  general  rule  in  the  fuzzy-P  part  of  the  P+ID  controller  can  be  written  as: 
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RULEi{i  =  \,  2,... ,25): 

If  e{k)  is  {e{k))  and  e{k)  is  (e(k))  then  Ak^  (k  +  l)  =  C‘ 

where  juj  and  ju^g  are  one  of  the  fiizzy  sets  NB,  NS,  ZO,  PS  and  PB.  The  consequent 

C*  takes  on  values  in  the  set  {-  /J,-a,0,a,  fi}.  The  rule  base  of  the  fuz2y-P  part  of  the 
controller  is  determined  from  expert  knowledge  and  shown  in  Table  9.1.  The  crisp 
output  of  the  fiizzy-P  system  then  can  be  calculated  as: 

Ec'a- 

Axf  (k  +  l)  =  -  (9.2.2) 

/ 

where  /  =  1,...25  and  ju.  is  the  premise  value  of  rule  i. 


Table  9.1  -  Rule  base  for  fiiazy  P+ID  controller 


. . . . .  - _ e _ 

NB 

NS  _ 

ZO 

PS 

PB 

e 

PB 

ZO 

PS 

PB 

PB 

PB 

PS 

NS 

ZO 

PS 

PB 

PB 

ZO 

NB 

NS 

ZO 

PS 

PB 

NS 

NB 

NB 

NS 

ZO 

PS 

NB 

NB 

NB 

NB 

NS 

ZO 

There  are  three  kinds  of  parameters  in  the  fii2zy  P+ID  controller:  fiizzy  control 
parameters,  PID  parameters,  and  scaling  gains  for  the  fuzzy-P  subsystem.  The  quantities 
a  and  P  are  fiizzy  control  parameters.  The  proper  tuning  of  a  and  p  will  greatly 

improve  performance.  In  general,  a  is  related  to  steady-state  accuracy  and  p  is  related 
to  the  response  speed.  Small  a  implies  high  steady-state  accuracy;  large  P  speeds  up 
the  transient  response  [73].  The  PID  parameters  Kp,  Kj,  and  determine  the  control 
resolution  and  the  stability  of  the  controlled  system.  According  to  [73],  the  proven 
sufiRcient  conditions  show  that  the  system  stability  is  not  destroyed  when  a  fiizzy  P+ID 
controller  takes  the  place  of  a  conventional  PID  controller.  This  means  that  closed-loop 
stability  is  guaranteed  regardless  of  the  choice  of  a  and  p.  The  quantities  and 

are  scaling  gains  for  the  fiizzy-P  subsystem.  Control  parameters  and  scaling  gains 

will  be  determined  by  an  evolutionary  algorithm.  The  PID  parameters  will  be  determined 
using  the  Ziegler-Nichols  tuning  method. 
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Fvnliitinnarv  Design  of  Fii7zv  P+ID  Controller 

The  design  of  fiizzy  controllers  is  complicated  by  the  fact  that  we  do  not  know  the 
relationship  between  membership  functions  and  the  control  performance.  To  avoid  a 
tedious  trial  and  error  tuning  process,  the  following  evolutionary  algorithm  is  proposed  to 
automatically  tune  the  five  parameters  cc ,  0 ,  S^,  and  of  the  fiizzy  P  +ID 

controller. 


Rvolution  Algorithm: 

1 .  Initialize  randomly  an  even  number  of  P  individuals. 

2.  Evaluate  the  fitness  of  the  P  individuals  using  the  fitness  function  /  defined  in 
(9.2.3)  below. 

3.  Select  the  P/2  fittest  individuals  and  copy  them  as  the  first  P/2  individuals  of  the 
next  generation.  Discard  the  other  individuals. 

4.  Apply  the  mutation  operator  defined  in  (9.2.4)  below  to  the  individuals  selected  in 
3  and  generate  another  P/2  individuals  for  the  next  generation. 

5.  Go  to  Step  2  and  repeat  until  the  stopping  criterion  is  met. 


A  real  vector  p(«)  =  [a{ri),  p{ri),  S^{n),  S^(n)]  €  is  used  in  the  above 

algorithm  to  represent  the  nth  individual  in  the  population,  which  can  be  evolved  by  a  set 
of  mutation  operations  and  the  elitist  selection  operation.  In  each  generation,  the 
individuals  with  fitness  in  the  top  50%  are  used  to  create  the  population  for  the  next 
generation.  For  control  applications,  the  well-known  ITAE  criterion  is  often  used  to  form 
the  fitness  function: 


K 

Z 

k=\ 


e(k) 


r(k) 


\kT 


(9.2.3) 


where  K  is  the  total  number  of  time  steps  in  the  calculation,  e(k)  is  the  tracking  error  at 
the  jfeth  time  instant,  r(k)  is  the  desired  output  at  the  Ath  time  instant,  and  T  is  the  step 
size.  Note  that  the  reference  signal  r{k)  is  always  greater  than  zero.  Thus,  the  larger  the 

value  of  the  fitness  function,  the  better  the  tracking  performance. 

We  define  the  following  mutation  operator  for  creation  of  new  individuals  from  the 
fittest  of  the  previous  generation: 


P J.m*l  («  +  %)  =  Pj^  (")  +  <^jPj 


f 

J  n 


rm 


f 

\  •'max  j 


(9.2.4) 
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where  y  =  l,...,5  indexes  parameters  in  individuals,  m  is  the  generation  number, 
n  =  “‘iividuals  with  the  P/2  largest  fitnesses  in  the  current 

generation,  P  is  the  population  size,  (Tj  is  a  learning  coefficient  which  determines  the 
amount  of  change  of  parameter  j  fi-om  generation  to  generation,  pj  is  a  Gaussian  random 

number  e  [-1,  1] ,  is  the  fitness  of  individual  n  in  the  /nth  generation,  and  is 
the  largest  fitness  in  the  /nth  generation. 

In  consideration  of  the  constraints  0  <  a  <  <  1 ,  the  following  techniques  are  used 

to  deal  with  illegal  individuals  possibly  generated  by  the  mutation  operators: 

1.  If  a«+i(«+%)<0,  a«+i(w +%)  =  /?«(«),  where  is  a  random  number 
uniformly  distributed  €  [O,  l]. 

2.  If  (n+^)>l,  («  +  %)  =  Pp  («)  ’  where  is  a  random  number 

uniformly  distributed  e  [O,  l] . 

3.  If  a;„+i(«+%)>y^m+i («  +  %)»  exchange  the  values  of  a«+i(/2+%)  and 

Pm+\  (P  ' 


9.3  Experimental  Results  and  Discussion 

In  this  section,  we  design  a  fuz2y  P+ID  controller  for  a  PM  hanging  vertically 
actuating  a  mass  as  shown  in  Figme  2.2.  The  system  is  located  in  the  Human  Sensory 
Feedback  Laboratory  at  Wright-Patterson  Air  Force  Base.  The  design  procedure  for  the 
controller  is  as  follows.  First,  a  recurrent  neuro-fuzzy  model  of  the  PM  is  derived  fi'om 
experimental  data  taken  fi-om  the  physical  system  using  the  VISIT  and  BP  algorithms  as 
outlined  in  Chapter  6. 

This  RNFIS  model  is  then  used  in  place  of  the  actual  PM  system  in  the  tuning  of 
the  five  parameters  of  the  fuzzy  controller  {oc,  using  the  evolutionary 

method  discussed  in  Section  9.2  and  also  to  determine  the  PID  gains  Kp,  Kj,  and  K ^ . 
The  PID  gains  are  found  via  the  well-known  Ziegler-Nichols  tuning  method  applied  to 
the  identified  recurrent  neuro-fuzzy  system.  We  discuss  each  of  these  below. 

Fvnintinnarv  Design  of  Fii7:7v  P+TD  Controller  Using  Neuro-fiizzy  Model 

An  optimal  fiizzy  P+ID  controller  is  designed  for  the  pneumatic  muscle  via  the 
evolutionary  algorithm  of  the  previous  section  based  on  the  RNFIS  model  for  the  PM 
obtained  in  Chapter  6.  Note  that  parameters  a,  p,  S^,  S^e,  and  were  optimized,  so 
we  needn’t  specify  scaling  gains  in  advance  according  to  the  actual  ranges  of  e  and  e . 
PID  parameters  were  set  as  =-0.7,  Kj  =-0.22,  and  Kd  =-0.5  using  the  Ziegler- 
Nichols  tvming  method  based  on  open-loop  tests  with  the  identified  RNFIS  model. 
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After  400  generations  with  a  population  of  P  =  120,  the  following  optimal 
parameters  are  obtained:  a*  =0.203,  =0.632,  5e  =0.9982,  =02232,  and 

S*dx  =  1.002 .  Corresponding  membership  functions  of  the  fuzzy  controller  are  plotted  in 
Figure  9.2,  and  its  control  surface  is  plotted  in  Figure  9.3. 


We  tested  the  controller’s  ability  to  force  the  PM  to  track  a  reference  signal.  For 
these  experiments  the  pneumatic  muscle  was  loaded  with  a  mass  of  20  kg.  The  initial 
length  of  the  muscle  is  5  cm,  corresponding  to  the  PM  fully  deflated  and  extended.  For 
comparison,  the  P+ID  controller  performance  is  compared  to  that  of  the  fuzzy  model 
reference  learning  controller  (FMRLC)  which  was  tuned  by  trial  and  error  for  best 
performance  in  Chapter  8.  The  tracking  performance  of  both  controllers  is  plotted  in 
Figure  9.4,  and  a  comparison  of  the  tracking  errors  is  shown  in  Figure  9.5.  The  reference 
signal  is  a  combination  sinusoid/triangle  function. 
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Figure  9.4  -  Tracking  performance  with  (a)  FMRLC  controller,  (b)  fu2zy  P+ID 

controller. 
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Figure  9.5  -  Comparison  of  tracking  errors 


From  Figure  9.5,  we  see  that  the  evolutionary  fuzzy  P+ID  controller  performance  is 
superior  to  that  of  the  FMRLC  i.e.  the  tracking  error  is  less.  To  compare  the  tracking 
performances  rigorously,  we  use  the  maximal  deviation,  defined  as 

MD  =  rtm\r{1c)-y{kfj  (9.3.1) 

k 


and  the  average  deviation,  defined  as 


AD  =  — 
^1 


r 


'Zm-yik} 


(9.3.2) 


where  it  ranges  over  all  time  samples,  r{k)  is  the  reference  signal,  y{k)  is  the  PM  length, 


and  K1  is  the  total  number  of  time  steps.  For  the  FMRLC,  the  maximal  deviation  is  MD 
=  0.41  and  the  average  deviation  is  AD  =  0.1173.  For  the  evolutionary  fiizzy  P+ID 
controller,  we  have  MD  =  0.16  and  AD  =  0.0558,  demonstrating  the  superiority  of  the 
evolutionary  ftizzy  P+ID  controller. 
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The  control  actions  exerted  by  the  two  controllers  in  the  above  tracking 
performances  are  plotted  in  Figure  9.6.  The  control  input  from  the  evolutionary  fuzzy 
P+ID  controller  is  seen  to  be  less  than  that  from  the  FMRLC.  Therefore,  the  evolutionary 
fuzzy  P+ID  controller  attains  better  tracking  performance  using  smaller  control  effort 
than  the  FMRLC.  Because  the  control  signal  in  the  PM  system  corresponds  to  the  volts 
supplied  to  the  valve,  this  means  that  the  fuzzy  P+ID  controller  is  more  efficient,  i.e.  can 
use  less  energy  and  get  better  results.  This  can  be  valuable  in  applications  where  energy 
is  limited,  such  as  use  in  remote  locations,  imdfcrwater  or  spacecraft  applications. 


Figure  9.6  -  Comparison  of  control  actions 

It  should  be  noted  that  the  FMRLC  used  for  these  experiments  was  tuned  over 
many  trials  to  yield  good  tracking.  To  be  fair,  however,  these  FMRLC  results  may  not  be 
the  best  attainable;  better  tracking  may  result  from  further  tuning.  In  contrast,  the 
excellent  tracking  results  shown  in  Figure  9.4(b)  for  the  evolutionary  fuzzy  P+ID 
controller  were  obtained  upon  first  application  of  the  controller,  i.e.  no  tuning  was 
required.  Most  fuzzy  systems  require  extensive  tuning  before  they  perform  satisfactorily. 
Therefore,  the  combination  of  neuro-fuzzy  modeling  and  evolutionary  design  based  on 
this  model  is  a  powerful  design  tool  for  fuzzy  control. 
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10  Fuzzy  Model  Predictive  Control  for  a  Planar  Arm 
Actuated  by  Four  Pneumatic  Muscle  Groups 


10.1  Introduction 

Model  predictive  control  (MPC)  is  based  on  the  optimization  of  a  finite-time 
fimctional  of  the  plant  tracking  error  and  the  control  effort.  It  was  originally  developed 
for  process  control  applications,  which  are  typically  highly  nonlinear  with  slow  dynamics. 
This  is  because  of  the  typically  high  computational  burden  of  MPC.  With  the  advent  of 
fester  computation  capabilities,  MPC  has  found  applications  in  control  of  plants  with 
faster  dynamics  as  well.  MPC  is  especially  advantageous  for  the  control  of  highly 
nonlinear  systems,  such  as  robotic  systems  actuated  by  pneumatic  muscles  ([74]-[81]). 

In  this  chapter,  MPC  is  implemented  by  discretizing  a  continuous-time  model  of 
the  arm  actuated  by  PMs,  then  formulating  an  81 -rule  Takagi-Sugeno  model  based  on 
linearization  of  the  arm  model  at  81  points  in  the  state  space.  The  controller  is  derived  by 
finding  the  input  sequence  that  minimizes  a  finite-time  horizon  of  the  tracking  error. 
Simulations  showing  tracking  performance  of  the  arm  demonstrate  the  effectiveness  of  the 
method.  Also,  we  compare  the  performance  of  the  MPC  to  that  of  the  sliding  mode 
controller  of  Chapter  5. 


10.2  Takagi-Sugeno  Model  of  Planar  Arm  Actuated  by  Four  Groups  of  PMs 

For  development  of  the  fuzzy  MPC,  we  use  the  model  of  the  planar  arm  developed 
in  Chapter  5  (i.e.  5.3.13  -  5.3.15).  Define  the  state  vector  for  the  arm 
x=[xi  X2  xs  x^f  =  [^1  $2  di  02]^.  Then  fi-om  (5.3.13),  the  arm  model  can  be 
written  as 


X  =  g{x)  +  m{x)u 


(10.2.1) 


where 


m{x)  = 


(10.2.2b) 


02x2 

G(x) 


u  = 


^Ps 

Ape 


Also  define  an  output  y  consisting  of  the  two  joint  angles,  i.e. 


y  =  Cx 


where 


C  = 


10  0  0 
0  10  0 


Linearizing  the  model  (10.2.1)  about  an  operating  point  x*  gives 

X  —  AiX  +  BiU  +  Di 


where 


■  0 

0 

1 

0  ■ 

0 

0 

0 

1 

dai 

da\ 

da\ 

da\ 

dxi 

dx2 

dX2 

dxi 

da2 

da2 

da2 

da2 

_  dx\ 

dX2 

dxz 

dxi  ^ 

x=x^ 

r 

0 

0 

1 

Bi  = 


0 

Gii(x) 


0 

Gi2{x) 


and 


(10.2.2c) 


(10.2.3) 


(10.2.4) 


(10.2.5) 


(10.2.6a) 


(10.2.6b) 
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(10.2.6c)  • 


0 

0 

“■  Aiz\Xi  —  Ai^2^2  ~~  -^i33^3  “  -^134^4 


^2  ■^i41^1  -^i42^2  -^143^3  ■^i44^4  J  3^= 


This  linearized  model  is  valid  in  the  vicinity  of  the  point  x\ 

Defining  a  sample  time  T  and  approximating  the  derivative  as 


x{t)  =  x{kT) 


xi(k  +  1)T)  -  x{kT) 
T 


(10.2.7) 


where  A;  is  an  integer,  the  linear  continuous-time  model  (10.2.5)  can  be  approximated  in 
the  vicinity  of  x*  by  the  linear  discrete-time  system: 

x{k  +  1)  =  (/  -b  TAi)x(k)  +  TBiu{k)  +  TDi  (10.2.8) 

Now  define  three  fuzzy  sets  on  xi's  and  X2's  universes  of  discourse  characterized 
by  the  membership  functions  in  Figures  10.1a  and  10.1b.  Similarly,  define  three  fuzzy  sets 
on  xa's  and  X4's  universes  of  discourse  characterized  by  the  membership  functions  in 
Figure  10.1c.  Consider  all  possible  combinations  of  the  fiizzy  set  centers,  defining 
3“^  =  81  points  x\  i  =  1,  ...,  81  in  the  state  space  about  which  to  linearize  (10.2.1). 
Discretizing  these  81  linearized  systems  yields  an  81 -rule  discrete-time  TS  fuzzy  model  of 
the  arm.  A  rule  in  this  model  is 

Rule  i  (i  =  1, . . . ,  81)  .* 

If Xi (k)  is  ni  and  X2ik)  is  and  X3(fc)  is  and  X4^{k)  is  fx\  then 
x{k  +  1)  =  (7  -I-  TAi)x(k)  +  TBiu{k)  -b  TDi,  y{k)  =  Cx{k). 
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Note  that  the  linearization  is  performed  about  the  points  listed  in  Table  10.1: 

Table  10.1  -  Linearization  points  for  the  81  rules  in  rule  base 
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T 

The  resulting  global  TS  fuzzy  model  is  given  by 


81  81  81 
x{k  +  1)  =  +  TAi)x{k)  4-  ^WiTBiu{k)  +  'y'wjTDj 


i=l 


i=l 


i=l 


where  Wi(x)  is  the  degree  of  match  of  rule  i  for  an  input  x,  calculated  as 


Ij,\{x)f4ix)f4(x)fj,\(x) 


Ui  .... 

(^)/^2  i^)f4 

j=i 


Therefore,  the  global  TS  fuzzy  model  is  given  by 

x{k  +  1)  =  Ax{k)  +  Bu{k)  +  D 


(10.2.9) 


(10.2.10) 


(10.2.11) 
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where 


81 

A  =  J2wi(I  +  TAi)  (10.2.12a) 

i=\ 


81 

(10.2.12b) 

2=] 


81 

D  =  YjjJiTDi  (10.2.12c) 

i=l 

In  the  next  section,  the  matrices  A,  B,  and  D  will  be  labeled  A{k),  B{k),  and  D{k) 
because  they  change  with  every  time  step  according  to  Wi,  which  depends  on  the  values  of 
the  state  x. 

It  should  be  noted  that  positive  values  of  imply  the  shoulder  tricep  is  inflating 
and  the  shoulder  bicep  is  deflating,  while  negative  values  of  x^  imply  the  opposite. 
Similarly,  positive  values  of  X4  imply  the  elbow  bicep  is  inflating  and  the  elbow  tricep  is 
deflating,  while  negative  values  of  X4  imply  the  opposite.  This  determines  which  values  of 
Bq  and  Bi  (2.3.2)  to  use  when  performing  the  linearization  in  (10.2.6). 

The  matrices  for  the  81 -rule  TS  fuzzy  model  of  the  arm  are  listed  in  Appendix  A  of 
this  report.  In  practice,  a  TS  fuzzy  model  could  be  derived  from  experiments  on  the  actual 
arm,  rather  than  by  linearizing  a  mathematical  model. 


10.3  Model  Predictive  Control  of  Planar  Arm  Actuated  by  Four  Groups  of  PMs 

Define  the  values  of  matrices  A,  B,  and  D  (10.2.12)  at  tune  step  A:  to  be  A(fc), 
B(k),  and  D(k).  Then  the  TS  fuzzy  model  of  the  arm  actuated  by  PMs  is 

x{k  +  1)  =  A{k)x{k)  +  B{k)u{k)  +  D{k)  (10.3.1a) 
y{k)  =  Cxik)  (10.3.1b) 

Since  CB(k)  =  02x2  VA;,  from  (10.3.1),  we  have 


y(k  +  2)  =  CA(k  -b  l)A(A;)a;(A:)  +  CAik  -f-  l)B{k)u{k)  -b  CA{k  +  l)D{k)  (10.3.2) 
where  we  have  used  the  fact  that  CB  =  CD  —  0. 
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Let  r{k)  be  a  vector  of  desired  reference  input  trajectories  and  consider  the 
performance  index 


J  =  [y(k  +  2)  -  r{k  +  2)fP[y{k  -f-  2)  -  r{k  +  2)]  +  u^(k)Qu{k)  (10.3.3) 


where  P  and  Q  are  symmetric  positive  definite  weighting  matrices.  Using  (10.3.2)  and 
setting  dJ/du{k)  =  0  yields  the  optimal  control 

u*(k)  =  [B'^{k)A'^{k  +  l)CfPCA{k  +  l)B{k)  +  Q]"'  • 

•  B^(k)A'^ik  +  l)Cf^P[r(k  +  2)  -  CA{k  +  l)A(k)x(k)  -  CA{k  +  1)L>(A:)]  (10.3.4) 


Since  A{k  +  1)  is  not  yet  known,  we  initially  use  A{k  +  1)  =  A{k)  in  (10.3.4)  and  use 
this  to  calculate  an  interim  value  for  u{k)  via  (10.3.4).  Now  this  interim  value  of  u{k)  is 
used  to  calculate  a  predicted  value  for  x{k  4-  1)  on  the  basis  of  the  TS  fuzzy  model 
(10.3.1).  From  this  predicted  x{k  +  1),  a  predicted  value  for  A{k  +  1)  can  be  obtained  as 
in  (10.2.10),  (10.2.12).  This  predicted  value  for  A{k  +  1)  along  with  the  known  A{k), 
B{k),  and  D{k)  and  the  true  plant  state  x{k)  are  then  used  again  in  (10.3.4)  to  calculate  a 
final  optimal  control  at  time  k,  i.e.  u*{k).  This  process  is  repeated  at  every  time  step. 


10.4  Simulation  Results 

The  planar  arm  of  Figure  5.3  with  opposing-pair  PM  groups  actuating  the  shoulder 
and  elbow  joints  is  simulated  using  a  4th-order  Runge-Kutta  algorithm  with  a  step  size  of 
0.01  seconds.  Let  1^=1^=  0.46  m,/ci  =  lc2  =  0-23  m,  mi  =  m2  =  10  kg,  =  7.62 
cm,  and  =  5.08  cm,  =  6  and  =  3.  Thus  the  conditions  are  identical  to  those  of 
the  Chapter  5  simulations.  For  these  simulations  we  assume  all  physical  quantities  of  the 
arm,  i.e.  masses,  lengths,  etc.  are  exactly  known,  but  that  the  PM  coefficients,  i.e.  F,  K, 
and  B  are  not  known  with  precision.  Assume  all  12  shoulder  PMs  (6  pairs)  are  matched 
to  each  other,  but  not  to  the  elbow  PMs.  Similarly  assume  all  6  elbow  PMs  (3  pairs)  are 
matched  to  each  other,  but  not  to  the  shoulder  PMs. 

The  model  predictive  controller  is  designed  according  to  (10.3.4)  using  ideal 
values  for  all  F,  K,  and  B  coefficients  (see  Section  2.3).  To  investigate  robustness  of  the 
model  predictive  controller  and  compare  it  to  the  sliding  mode  controller,  we  randomly 
choose  three  sets  of  actual  (nonideal)  F,  K,  and  B  coefficients  from  a  uniform 
distribution  within  ±50%  of  their  nominal  values.  The  coefficients  used  are  listed  in  Table 
5.1.  Note  that,  for  comparison  purposes,  we  use  the  same  coefficients  for  the  MPC  that 
were  used  for  the  sliding  mode  controller  of  Chapter  5. 

For  all  simulations,  we  use  P  —  S  x  10®l2x2»  Q  =  10“^/2x2j  and  T  =  0.01  sec. 
Also  let  Pots  =  Pote  =  310.3  kPa  (45  psi).  Pots  =  490.4  kPa  (71.1  psi)  and  Po6e  =  310.5 
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kPa  (45  psi).  These  noiminal  pressures  insure  that  if  control  is  lost,  shoudler  and  elbow 
angles  will  revert  to  the  equilibrium  positions  ^2eq  =  7r/2  (see  Chapter  5). 

With  these  nominal  pressures  and  with  ng  as  above,  all  PM  pressures  remain  in  the 
allowable  range  206.844  <P<  620.532  kPa  (30  <  P  <  90  psi)  for  all  control  tasks  in 
this  section. 

We  investigate  controller  performance  in  tracking  a  sinusoidal  spline  in  x-y  space 
(5.6.2).  The  spatial  tracking  performance  for  the  true  plant  with  PM  coefficients  in  set  1  is 
shown  in  Figure  10.2.  Tracking  performance  when  PMs  are  described  by  coefficient  sets 
2  and  3  are  similar  to  Figure  10.2  and  are  not  shown.  It  will  be  noted  that  there  is  some 
spatial  tracking  error,  which  is  to  be  expected  due  to  the  parameter  errors.  For 
comparison  purposes,  tracking  performance  for  the  identical  plant  using  sliding  mode 
control  (Chapter  5)  is  shown  in  Figure  5.4. 

The  MPC  elbow  angle  tracking  error  (which  is  larger  than  the  shoulder  angle 
tracking  error  in  this  case)  for  coefficient  sets  1,  2,  and  3  is  shovra  in  Figure  10.3.  For 
comparison  purposes,  the  elbow  tracking  errors  for  the  same  three  coefficient  sets  under 
sliding  mode  control  from  Chapter  5  are  shown  in  Figure  5.6. 

Figure  10.4  shows  the  control  efforts  Ap^,  Apg  that  produced  the  tracking 
performance  in  Figure  10.2.  It  will  be  noted  that  with  the  nominal  pressures  Pots,  Pohs, 
Pote,  and  Pobe  given  above,  the  PM  input  pressures  remain  vwthin  the  allowable  range  for 
these  PMs.  For  comparison  purposes,  the  control  efforts  from  sliding  mode  control  of 
the  identical  plant  are  shown  in  Figure  5.5. 
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Tracking  error  (rad) 


Figure  10.3  -  MFC  elbow  angle  tracking  errors  for  3  different  plants  (PM  coefficient  sets 
1,  2,  and  3),  sinusoidal  spline  reference  trajectory,  m\  =  m2  =  10  kg. 
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Figure  10.4  -  MFC  control  effort  producing  tracking  performance  of  Figure  10.2. 


Sinusoidal  spline,  doubled  mass 

In  practical  applications,  it  may  be  expected  that  the  mass  actuated  by  the  arm  will 
change.  To  investigate  the  robustness  of  the  sliding  controller  to  changing  masses,  we 
increased  the  arm  masses  mi,  m2  each  by  a  factor  of  2  to  20  kg  and  used  the  same  model 
predictive  controller  as  above  to  track  the  sinusoidal  spline  reference  trajectory.  Both  the 
shoulder  and  elbow  angle  tracking  errors  are  similar  to  those  in  Figure  10.3,  indicating 
that  the  fuz2y  model  predictive  controller  is  robust  to  changes  in  mass. 

Figure  10.5  shows  the  control  effort  produced  by  the  fuz2y  model  predictive 
controller  for  this  plant.  Naturally,  the  control  effort  is  larger  due  to  the  increased  arm 
masses.  It  is  seen  that  with  the  above  nominal  pressures,  the  PM  pressures  remain  within 
the  allowable  range  for  the  duration  of  the  control  process.  In  Figure  5.13,  for 
comparison  purposes,  we  show  the  corresponding  control  efforts  resulting  from  sliding 
mode  control  of  the  identical  plant. 
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Figure  10.5  -  MFC  control  effort,  sinusoidal  spline  reference  trajectory,  coefficients  in  set 

1,  mi  =  m2  =  20  kg. 


10.5  Discussion  and  Conclusions 

A  fuzzy  model  predictive  controller  has  been  designed  for  a  2  DOF  planar  arm 
assembly  with  highly  nonlinear  pneumatic  muscle  actuators  in  opposing  pair  configurations 
actuating  the  shoulder  and  elbow  joints.  The  control  input  for  the  planar  assembly  enters 
the  process  through  nonlinear  spring  and  friction  coefficients  and  a  nonlinear  contractile 
force  term  that  are  contained  within  a  mathematical  model  for  the  pneumatic  muscle 
actuators.  A  dynamic  model  for  the  arm  with  four  groups  of  PM  actuators  is  derived. 
This  model  is  linearized  about  81  operating  points  in  the  state  space  to  produce  an  81 -rule 
TS  fiizzy  model  of  the  arm.  A  two-input  model  predictive  controller  is  designed  on  the 
basis  of  this  TS  fuzzy  model. 

Simulations  of  closed-loop  tracking  were  performed  with  a  sinusoidal  spline  spatial 
reference  path  desired  for  the  end  effector.  Closed-loop  tracking  performance,  resulting 
from  simulations,  is  seen  to  be  in  line  with  corresponding  results  obtained  with  sliding 
mode  contoller  for  the  identical  plant.  Closed-loop  tracking  with  several  sets  of  PM 
coefficients  within  a  ±50%  range  of  ideal  are  compared.  The  model  predictive  controller 
is  also  shown  to  be  robust  for  a  100%  change  in  arm  masses. 
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It  should  be  noted  that  P  and  Q  chosen  for  our  MPC  were  not  the  best  possible. 
They  were  chosen  to  give  similar  results  to  those  of  Chapter  5.  The  MPC  tracking 
performance  can  always  be  improved  by  adjusting  P  and  Q  appropriately.  Similar 
statements  could  be  made  about  the  sliding  mode  controller.  Sliding  mode  control  has 
several  user-chosen  parameters  that  can  profoundly  affect  performance.  For  this  reason,  it 
is  not  meaningful  to  attempt  to  determine  which  of  the  two  controllers  is  "best." 

However,  the  following  can  be  stated  from  a  comparison  of  sliding  mode  and 

MPC: 

1.  From  a  comparison  of  Figures  5.4  and  10.3,  we  see  more  oscillatory  arm  movement 
with  the  fuzzy  MPC  designed  above  than  with  sliding  mode  controller  of  Chapter  5, 
especially  in  the  initial  stages.  This  can  be  a  disadvantage  for  flexible  systems  where 
unmodeled  higher  frequency  modes  can  be  excited  by  vibrations. 

2.  The  sliding  mode  controller  produces  greater  initial  tracking  errors  but  smoother 
overall  arm  movement,  while  fuzzy  MPC  produces  smaller  initial  errors  but  greater  errors 
in  the  middle  and  end  of  the  task.. 

3.  From  Figures  5.6,  10.4,  5.5,  and  10.5,  we  see  greater  initial  control  effort  (i.e.  pressure 
input  to  the  PMs)  with  sliding  mode  control  than  with  fuzzy  MPC.  This  can  be  a 
significant  factor,  since  input  pressure  is  always  limited  by  the  supply  pressure,  which  is 
typically  less  than  1000  kPa. 

4.  After  initial  transients,  sliding  mode  control  produces  a  much  smoother  pressure 
command  than  does  the  fuzzy  MPC  designed  above.  This  can  be  a  significant 
consideration,  since  pressure  variations  necessitate  excessive  control  valve  action,  which 
can  shorten  the  life  of  the  valve. 

5.  Fuzzy  model  predictive  control  is  a  simple  linear  control  law.  It  is  much  easier  to 
implement  than  sliding  mode,  which  is  nonlinear,  time-varying,  and  has  boundary  layers 
across  which  the  control  law  changes. 

6.  Tracking  performance  of  both  controllers  can  be  made  as  good  as  desired  by  adjusting 
the  user-defined  parameters  appropriately.  The  price  paid  for  this  greater  accuracy  is 
usually  greater  control  effort. 

7.  In  practice,  the  TS  fuzzy  model  could  be  obtained  from  experiments  on  the  actual  arm, 
rather  than  by  linerizing  a  mathematical  model. 
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11  Conclusions 


This  research  has  consisted  of  applying  nonlinear  control  techniques  to  the  difficult 
problem  of  controlling  systems  actuated  by  pneumatic  muscle  actuators  (PMs).  PMs  are 
lightweight,  cheap,  powerful  actuators  that  are  particularly  suitable  for  robot  actuation 
and  for  use  in  exoskeletons  because  of  their  similarity  to  human  muscle.  Their  advantage 
lies  in  the  fact  that  in  robotic  and  exoskeletal  systems,  the  actuators  must  be  moved  along 
with  the  system  itself,  as  is  the  case  for  a  robot  whose  actuators  are  mounted  on  the  links, 
or  an  exoskeleton  whose  actuators  are  mounted  on  the  wearer’s  limbs.  If  the  actuators 
are  heavy  (as  in  the  case  of  electric  motors),  this  can  significantly  subtract  from  the 
system’s  payload.  PMs  have  the  highest  power/weight  and  power/volume  ratios  of  any 
actuator. 

PM  control  is  complicated  by  the  fact  that  they  are  highly  nonlinear  and  time- 
varying.  Their  nonlinearity  arises  because  of  their  construction,  and  their  time-varying 
nature  is  a  result  of  heat  generation  due  to  friction,  which  is  a  natural  consequence  of  the 
PM’s  repeated  inflation  and  deflation.  The  control  philosophies  chosen  for  this  research 
are  adaptive  control,  sliding  mode  control,  model  predictive  control,  and  neuro-fiizzy 
control.  The  methods  were  chosen  because  of  their  ability  to  handle  these  conditions. 

Some  of  our  algorithms  were  implemented  on  the  Pneumatic  Muscle  Test  Station 
in  the  Human  Effectiveness  Laboratory  (HECP))  at  Wright  Patterson  Air  Force  Base, 
which  is  under  the  direction  of  Dr.  D.  W.  Repperger.  This  enabled  us  to  test  these 
algorithms  on  actual  PMs,  which  gave  us  invaluable  experience  and  insight  into  their 
properties.  Other  algorithms  involved  too  many  PMs  to  test  in  HECP.  For  these  systems, 
we  relied  on  simulations  to  investigate  our  controllers.  We  took  great  pains  to  insure 
that,  to  the  greatest  extent  possible,  the  simulations  were  accurate  and  agreed  with  the 
behavior  of  actual  PM-actuated  systems. 

Our  results  fall  into  two  broad  categories:  (a)  position  control  of  a  single  PM 
hanging  vertically  actuating  a  mass,  and  (b)  control  of  planar  robotic  manipulators  via 
groups  of  PMs.  The  former  could  be  carried  out  on  an  actual  PM  in  the  lab,  while  the 
latter  was  done  in  simulation.  The  research  on  systems  containing  more  than  one  PM 
was  concentrated  on  planar  arm  configurations  because  we  wanted  to  specifically  address 
anthropomoiphic  systems,  as  would  be  encoxmtered  in  exoskeletons. 

The  control  methods  used  for  (a)  above  include  the  Fxizzy  Model  Reference 
Learning  Control  (FMRLC)  strategy  (Chapter  8),  and  the  Fuzzy  P+ID  strategy  (Chapter 
9).  In  addition,  a  neuro-fiizzy  model  of  the  PM  was  obtained  from  data  taken  in  the  lab 
(Chapter  6).  This  model  was  necessary  for  the  control  method  of  Chapter  9,  which 
utilizes  an  evolutionary  algorithm  in  conjunction  with  the  Zeigler-Nichols  tuning  method 
for  PID  compensators  to  arrive  at  a  suitable  controller.  Both  methods  controlled  the  PM 
well,  but  the  Fuzzy  P+ID  method  (Chapter  9)  was  superior  to  the  FMRLC  (Chapter  8)  in 
that  more  accurate  tracking  resulted  while  using  less  control  effort.  Control  effort  is 
always  a  concern  in  any  real-world  control  task.  However,  an  advantage  of  the  FMRLC 
is  that  it  was  not  necessary  to  take  data  prior  to  designing  the  controller  and  no  a  priori 
modeling  was  necessary,  as  was  necessary  for  the  Fuzzy  P+ID  method.  Instead,  the 
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FMRLC  required  extensive  tuning,  which  was  not  necessary  for  the  Fuzzy  P+ID 
controller 

The  control  methods  used  for  (b)  above,  i.e.  control  of  planar  robotic  manipulators 
via  groups  of  PMs,  include  adaptive  control,  single-  and  multi-input  sliding  control,  and 
multi-input  Fuzzy  Model  Predictive  Control  (MPC).  In  our  opinion,  the  sliding  mode 
approach  (Chapters  4  and  5)  is  particularly  advantageous  for  PM  control  because  it  can 
handle  any  degree  of  nonlinearity  and  it  is  specifically  designed  assuming  modeling 
errors.  These  errors  are  always  present  when  dealing  with  PMs,  since  PMs  are  very 
difficult  to  model  accurately.  In  addition,  bounds  on  state  tracking  errors  are  available 
with  the  sliding  mode  approach.  For  planar  arm  robotic  systems  or  exoskeleton  systems, 
these  state  errors  can  be  translated  into  guaranteed  spatial  tracking  accuracies. 

The  adaptive  approach  (Chapter  3),  which  is  derived  from  the  sliding  mode 
approach,  has  the  advantage  of  being  capable  of  adapting  to  slowly-varying  dynamics. 
The  dynamics  of  systems  containing  PMs  do  vary  with  time,  due  to  heat  generated  by 
friction.  Therefore,  this  capability  of  adaptive  control  is  valuable.  However,  the  adaptive 
approach  requires  a  somewhat  more  accurate  model  of  the  system  dynamics.  This  is  a 
disadvantage  of  the  adaptive  approach,  since  the  dynamics  of  systems  containing  PMs  are 
always  poorly  known. 

Model  Predictive  Control  (MPC)  has  been  used  for  years  in  the  control  of  industrial 
processes.  These  processes  are  highly  nonlinear,  MIMO,  and  contain  time  delays, 
therefore  are  very  difficult  to  control.  MPC  has  traditionally  suffered  from  high 
computational  load,  and  has  therefore  been  relegated  to  processes  with  slow  dynamics. 
This  situation  has  changed  in  recent  years  with  the  advent  of  faster,  more  compact 
computers  with  more  memory,  hence  MPC  is  considered  in  this  research  for  PM  control. 

As  with  adaptive  and  sliding  mode  control,  MPC  requires  some  kind  of  model  of 
the  process.  An  advantage  of  MPC  is  that  this  model  can  be  of  any  kind.  In  this 
research,  we  have  modeled  the  planar  arm  actuated  by  four  groups  of  PMs  with  an  81- 
rule  Takagi-Sugeno  fuzzy  system.  This  model  was  derived  from  a  4-state  nonlinear  DE 
model  of  the  arm.  If  desired,  the  fuzzy  model  could  be  determined  experimentally, 
obviating  the  need  for  a  DE  model.  This  is  a  further  advantage  of  fuzzy  MPC  -  no  DE 
model  of  the  process  is  necessary,  and  a  fuzzy  model  of  the  nonlinear  system  can  be 
determined  experimentally.  Also,  MPC  has  well-known  robustness  properties,  which  are 
useful  for  systems  involving  PMs.  The  only  control  method  considered  in  this  research 
that  required  no  model  of  the  process  was  FMRLC  (Chapter  8). 

An  additional  outcome  of  this  research  is  the  Evolutionary  Variable  Input  Spread 
Inference  Training  (EVISIT)  algorithm.  EVISIT  is  a  novel  method  for  constructing  a 
frizzy  system  describing  a  process  on  the  basis  of  data  taken  from  the  process.  While  not 
absolutely  necessary  for  PM  control,  EVISIT  can  be  used  for  obtaining  a  fuzzy  model  of 
the  PM,  and  this  fuzzy  model  can  then  be  used  for  control.  EVISIT  is  also  quite  useful 
for  pattern  classification  problems  (Chapter  7). 

In  conclusion,  we  have  investigated  several  methods  for  controlling  systems 
containing  PMs.  The  methods  include  adaptive,  sliding  mode,  model  predictive,  and 
neuro-fuzzy  control.  Each  method  has  its  advantages  and  disadvantages.  The  methods 
are  immediately  useful  for  and  applicable  to  autonomous  robotic  systems  actuated  by 
PMs  because  these  systems  are  relatively  straightforward  control  problems  with  no 
human  interaction.  Exoskeletal  systems  will  require  more  investigation,  since  they  must 
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interact  in  real-time  with  the  wearer.  Thus  the  exoskeleton  problem  is  more  challenging 
and  interesting.  It  appears,  however,  that  the  control  methods  investigated  in  this 
research  may  be  useful  for  control  of  exoskeletons  actuated  by  PMs  as  well.  Further 
research  should  concentrate  on  exoskeleton  actuation  taking  into  account  wearer 
interaction. 
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Appendix  A 

Matrices  for  TS  Fuzzy  Model  of  Planar  Arm 

The  TS  fiizzy  model  of  the  planar  arm  is  an  81 -rule  fuzzy  system  given  in  Section 
10.2.  The  matrices  (7  -I-  TAi),  Bi,  and  7)j,  i  =  I,  ...  81  are  given  by 

1.0  0  0.01  0 

0  1.0  0  0.01 

£31  €32  €33  €34 

€41  €42  643  644 

0  0 
0  0 
’?32 
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Then  the  matrices  (/  +  TAi),  Bi,  Di  are  given  by  (A.l  -  A.6)  with 


«!  = 

■  -  1.45e  -  1 
+  4.41e-3 

+  1.18e  +  3 
-  2.76e  +  3 

+  l.OOe  +  0 
+  4.51e-4 

+  1.03e-4' 
+  l.OOe +  0 

0!2  = 

■  -  1.45e  -  1 
+  4.41e-3 

+  3.956  +  2 
-  L58e  +  3 

+  l.OOe  +  0 
+  4.51e-4 

+  1.03e-4' 
+  l.OOe  +  0 

az  = 

■  -  1.45e  -  1 
+  4.41e-3 

“h  1.326  2 

—  1.186  +  3 

+  l.OOe +  0 
+  4.51e-4 

+  1.03e-4' 
+  l.OOe +  0 

0:4  = 

-  1.45e  -  1 
+  4.41e  -  3 

+  2.636  +  2 
—  3.956  +  2 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e-4' 
+  l.OOe +  0 

as  = 

-  1.45e  -  1 
+  4.41e-3 

+  2.936  -  2 
~  2.596  -  1 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e-4‘ 
+  l.OOe +  0 

ag  = 

-  1.45e  -  1 
+  4.41e  -  3 

“h  2.636  “h  2 
--  3.956  +  2 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e-4' 
+  l.OOe +  0 

a-i  = 

■  -  1.45e-l 
+  4.41e  -  3 

-|-  1.326  -|-  2 
—  1,186  +  3 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e-4' 
+  l.OOe +  0 

as  = 

;  -  1.45e  -  1 
+  4.41e  -  3 

*1“  3.956  “h  2 
-  1.586  +  3 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e  -  4‘ 
+  l.OOe +  0 

ag  = 

■  -  1.45e  -  1 
+  4.41e  -  3 

“h  1.186  “4'  3 
-  2.76e  +  3 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e-4' 
+  l.OOe +  0 

aio  = 

■  -  1.73e  -  1 
+  8.67e-2 

-  1.756  +  1 
H-  9.306  +  2 

-  1.51e  +  0 
+  7.54e  +  0 

-  3.35e  +  0 
+  2.68e  +  0 

an  = 

■  -  1.73e-  1 
+  8.67e  -  2 

—  2.286  “h  2 
+  2.466  +  2 

■{"  1.62e  —  1 
+  6.70e  +  0 

-  1.68e  +  0 
+  1.84e  +  0 

ai2  = 

■  -  1.73e  -  1 
+  8.67e  -  2 

—  2.986  H“  2 
+ 1.786  +  1 

+  1.84e  +  0 
+  5.86e  +  0 

+  3.42e  -  5 
+  l.OOe  +  0 

II 

CO 

■  -  1.73e  -  1 
+  8.67e  -  2 

+  7.026  +  1 
“h  2.286  “I"  2 

-  6.76e  -  1 
+  8.38e  -  1 

-  1.68e  +  0 
+  1.84e  +  0 

156 


au  = 

■  -  1.73e  -  1 
+  8.67e  -  2 

-  2.05e  -  2 
+  1.43e-  1 

+  1.006  +  0 
+  1.846  -  4 

+  3.426  -  5 
+  1.006  +  0 

II 

I— ( 

■  -1.73e-  1 
+  8.67e  -  2 

+  7.01e+  1 
2.286  -f-  2 

+  2.686  +  0 
-  8.386  -  1 

+  1.68e  +  0 
+  1.626  -  1 

«16  = 

■  -  1.73e  -  1 
+  8.67e  -  2 

—  2.986  4"  2 
+ 1.766  +  1 

+ 1.626  -  1 
—  5.866  +  0 

+  3.42e  -  5 
+  1.006  +  0 

an  = 

■  -  1.73e  -  1 
+  8.67e  -  2 

—  2.286  "h  2 
+  2.466  +  2 

+ 1.846  +  0 
-  6.706  +  0 

+  1.686  +  0 
+  1.626  -  1 

ai8  = 

■  -  1.73e  -  1 
+  8.67e  -  2 

-  1.766  +  1 
+  9.306  +  2 

+  3.516  +  0 
-  7.546  +  0 

+  3.35e  +  0 
-  6.766  -  1 

Q:i9  = 

■  -  1.89e-  1 
+  1.19e-  1 

-  1.41e  +  3 
+  2.826  +  2 

+ 1.006  +  0 
-  3.22e  -  4 

-  7.336  -  5 
+  1.006  +  0 

«20  = 

■  -  1.89e  -  1 
+  1.19e-  1 

"t"  2.826  *{■  2 
+  1.136  +  3 

+ 1.006  +  0 
-  3.22e  -  4 

-  7.336  -  5 
+  1.006  +  0 

0^21  = 

■  -  1.89e  -  1 
+  1.19e-  1 

+  8.466  +  2 
+ 1.416  +  3 

+ 1.006  +  0 
-  3.226  -  4 

-  7.336  -  5 
+  1.006  +  0 

tt22  = 

■  -  1.89e  -  1 
+  1.19e  -  1 

-  5.646  +  2 

—  2.826  +  2 

+ 1.006  +  0 
-  3.946  -  4 

-  7.33e  -  5 
+  1.006  +  0 

«23  = 

!  -  1.89e-  1 
_  +1.19e-  1 

+  1.82e  -  1 
+  3.026  -  1 

+ 1.006  +  0 
-  3.946  -  4 

-  7.33e  -  5 
+  1.006  +  0 

0^24  = 

■  -  1.89e  -  1 
+  1.19e-  1 

-  5.646  +  2 

—  2.826  +  2 

+ 1.006  +  0 
-  3.946  -  4 

-  7.336  -  5 
+  l.OOe  +  0 

Q!25  = 

■  -  L89e-  1 
+  1.19e-  1 

+  8.466  +  2 
+ 1.416  +  3 

+  l.OOe  +  0 
-  3.946  -  4 

-  .7.33e  -  5 
+  l.OOe  +  0 

<^26  = 

■  -  1.89e  -  1 
+  1.19e-  1 

+  2.826  +  2 
+  1.136  +  3 

+ 1.006  +  0 
-  3.946  -  4 

-  7.33e  -  5 
+  l.OOe  +  0 

(X27  = 

■  -1.89e-  1 
+  1.19e-  1 

-  1.416  +  3 
+  2.826  +  2 

+ 1.006  +  0 
-  3.946  -  4 

-  7.33e  -  5 
+  l.OOe  +  0 
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0:28  = 

■  -  2.94e  -  3 
+  4.41e-3 

+  1.18e  +  3 
-  2.76e  +  3 

-f"  1.006  +  0 
+  4.51e  -  4 

4-  1.03e  -  4 
4- 1.006  4-  0 

029  = 

■  -  2.94e  -  3 
+  4.41e  -  3 

"h  3.956  “h  2 
-  1.58e  +  3 

+  1.006  +  0 
+  4.516  -  4 

4-  1.036  -  4 
4- 1.006  4-  0 

0:30  = 

■  -  2.94e  -  3 
+  4.41e  -  3 

+  1.326  +  2 
—  1.186  3 

+  1.006  +  0 
+  4.516  -  4 

4- 1.036  -  4 
4-  1.006  -f  0 

0:31  = 

■  -  2.94e  -  3 
+  4.41e-3 

+  2.636  +  2 
-  3.956  +  2 

+  1.006  +  0 
+  5.526  -  4 

4- 1.036  -  4 
4-1.006  4-0 

0:32  = 

■  -  2.94e  -  3 
+  4.41e  —  3 

+  8.926  -  4 
-  3.576  -  3 

+ 1.006  +  0 
+  5.526  -  4 

4- 1.036  -  4 
4- 1.006  4-  0 

0:33  = 

■  -  2.94e  -  3 
+  4.41e-3 

+  2.636  “h  2 
—  3.956  4“  2 

+  1.006  +  0 
+  5.52e  -  4 

4-  1.036-4 
4- 1.006  4-  0 

0:34  = 

■  -  2.94e  -  3 
+  4.41e-3 

"h  1.326  “h  2 
—  1. 186  +  3 

+ 1.006  +  0 
4-  5.526  -  4 

4- 1.036  -  4 
4- 1.006  4-  0 

0:35  = 

■  -  2.94e  -  3 
+  4.41e-3 

“h  3.956  -|-  3 
-  1.58e  +  3 

4-  l.OOe  4-  0 
4-  5.526  -  4 

4-1.036-4 
4-  1.006  4-0 

0:36  = 

■  -  2.94e  -  3 
+  4.41e  -  3 

+ 1.186  +  3 
--  2.766  +  3 

4- 1.006  4-  0 
4-5.526  -4 

-h  1.036  -  4 
4- 1.006  4-  0 

0:37  = 

■  +  2.55e  -  2 
+  2.00e  -  1 

-  1.756  +  1 
+  9.306  “h  2 

-  1.516 -KO 
4-  7.546  4-  0 

—  3.356  4-  0 
4-  2.686  4-  0 

0:38  = 

■  +  2.55e  -  2 

4"  2.006  —  1 

—  2.286  +  2 
”1“  2.466  “f“  2 

4- 1.626  -  1 

4"  6.706  4-  0 

-  1.686  4-0 
4-1.846  4-0 

0:39  = 

■  +  2.55e  -  2 
+  2.00e  -  1 

—  2.986  +  2 
+ 1.786  +  1 

4“  1.846  4"  0 

4"  5.866  4“  0 

-h  3.426 -5 
4-  l.OOe  4-0 

0:40  = 

■  +  2.55e  -  2 
+  2.00e  -  1 

+  7.02e  +  1 
+  2.286  4“  2 

-  6.766  -  1 

4"  8.386  —  1 

-  1.68e  +  0 
4-  1.84e  4-  0 
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«41  = 

2.556  —  2 
+  2.00e  -  1 

“1“  2.506  —  2 
+  1.806-1 

-f-  1.006  ”l“  0 
+  1.84e  -  4 

+  3.426  -  5 
+  1.006  +  0 

«42  = 

■  +  2.55e  -  2 
+  2.00e  -  1 

+  7.026  +  1 
+  2,286  +  2 

+  2.68e  +  0 
-  8.38e  -  1 

+  1.686  +  0 
~t"  1.626  —  1 

tt43  = 

"1”  2.556  —  2 
+  2.006  -  1 

—  2,986  +  2 
+  1.706  +  1 

+  1.62e  -  1 
-  5.86e  +  0 

+  3.426  -  5 
+  1.006  +  0 

0:44  = 

'  +  2.556  -  2 
+  2.006  -  1 

—  2,286  +  2 
+  2.406  +  2 

+  1.84e  +  0 
-  6.70e  +  0 

+  1.686  +  0 
+ 1.626  -  1 

«45  = 

+  2.556  —  2 
_  +  2.006  -  1 

-  1.706  +  1 
+  9.306  +  2 

+  3.516  +  0 
-  7.546  +  0 

4“  3.356  “t"  0 
-  6.766  -  1 

tt46  = 

■  -  6.316  -  3 
-  3.156  -  3 

-  1.416  +  3 
+  2.826  +  2 

+ 1.006  +  0 
-  3.226  -  4 

-  7.336  -  5 
+  1.006  +  0 

Q!47  = 

■  -  6.316  -  3 
-3.156-3 

+  2.826  +  2 
+  1.136  +  3 

+ 1.006  +  0 
-  3.22e  -  4 

-7.336-5 
+  1.006  +  0 

0:48  = 

■  -  6.316  -  3 
-  3.156  -  3 

+  8.406  +  2 
+  1.41e  +  3 

+  1.006  +  0 
-  3.226  -  4 

-  7.336  -  5 
+  1.006  +  0 

0^49  = 

■  -  6.316  -  3 
-  3.156  -  3 

-  5.046  +  2 

—  2.826  +  2 

+ 1.006  +  0 
-  3.946  -  4 

-  7.33e  -  5 
+  1.006  +  0 

0:50  = 

■  -6.316-3 
-3.156-3 

-  0.376  -  4 

-  2.556  -  3 

+ 1.006  +  0 
-  3.946  -  4 

-  7.336  -  5 
+  1.006  +  0 

0:51  = 

■  -  6.316  -  3 
-  3.156  -  3 

-  5.046  +  2 

—  2.826  +  2 

+  1.006  +  0 
-  3.946  -  4 

-  7.336  -  5 
+  1.006  +  0 

0:52  = 

■  -  6.316  -  3 
-3.156-3 

+  8.406  +  2 
+ 1.416  +  3 

+ 1.006  +  0 
-  3.946  -  4 

-  7.336  -  5 
+ 1.006  +  0 

«53  = 

■  -  6.316  -  3 
-  3.156  -  3 

+  2.826  +  2 
+ 1.136  +  3 

+ 1.006  +  0 
-  3.946  -  4 

-  7.336  -  5 
+  1.006  +  0 

«54  = 

■  -  6.316  -  3 
-3.156-3 

-1.41e  +  3 
+  2.826  +  2 

+ 1.006  +  0 
-  3.946  -  4 

-  7.336-5 
+  1.006  +  0 
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^55  = 

■  +  1.39e  -  1 
+  4.41e-3 

+  1.18e  +  3 
-  2.76e  +  3 

+  l.OOe  +  0 
+  4.51e  -  4 

+  1.03e  -  4 
+  l.OOe  +  0 

0^56  = 

'  +  1.39e-  1 
+  4.41e-3 

+  3.95e  +  2 
-  1.58e  +  3 

+  l.OOe  +  0 
+  4.51e  -  4 

+  1.03e-4 
+  l.OOe  +  0 

0^57  = 

■  +  1.39e  -  1 
+  4.41e-3 

”t-  1.326  -|“  2 
-  1.186  +  3 

+  l.OOe  +  0 
+  4.51e  -  4 

+  1.03e  -  4 
+  l.OOe  +  0 

^58  = 

■  +  1.39e  -  1 
+  4.41e  -  3 

+  2.636  +  2 
—  3.956  +  2 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e  -  4 
+  l.OOe  +  0 

<^59  = 

■  +  1.39e  -  1 
+  4.41e  -  3 

—  2.756  —  2 
+  2.526  -  1 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e  -  4 
+  l.OOe  +  0 

tteo  = 

■  +  1.39e  -  1 
+  4.41e  -  3 

+  2.636  +  2 
—  3.956  +  2 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e-4 
+  l.OOe  +  0 

«61  = 

■  +  1.39e  -  1 
4-  4.41e  -  3 

+  1.326  +  2 
-  1.186  +  3 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e-4 
+  l.OOe +  0 

«62  = 

■  +  1.39e  -  1 
+  4.41e  -  3 

+  3.956  +  2 
—  1.586  +  3 

+  l.OOe  +  0 
+  5.52e  -  4. 

+  1.03e  -  4 
+  l.OOe  +  0 

0163  = 

■  +  1.39e  -  1 
+  4.41e  -  3 

+  1.186  +  3 
-  2.766  +  3 

+  l.OOe  +  0 
+  5.52e  -  4 

+  1.03e  -  4 
+  l.OOe  +  0 

0:64  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

~  1.746  + 1 
+  9.306  +  2 

-  1.51e  +  0 
+  7.54e  +  0 

—  3.35e  +  0 
-|-  2.68e  -|-  0 

065  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

—  2.286  +  2 
+  2.466  +  2 

-  1.62e  -  1 
+  6.70e  +  0 

-  1.68e  +  0 
+  1.84e  +  0 

066  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

—  2.986  +  2 
+  1.786  + 1 

+  1.84e  +  0 
+  5.86e  +  0 

+  3.42e  -  5 
+  l.OOe  +  0 

067  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

+  7.036  + 1 
+  2.286  +  2 

-  6.76e  -  1 
+  8.38e  -  1 

-  1.68e  +  0 
+  1.84e  +  0 
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O'es  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

+  1.06e  -  1 
+  1.27e  -  1 

+  1.006  +  0 
+  1.84e  -  4 

+  3.42e-5' 
+  l.OOe +  0 

0^69  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

+  7.03e  + 1 
■j"  2.286  +  2 

+  2.68e  +  0 
-  8.386  -  1 

+  1.68e  +  0 
+  1.62e-  1 

«70  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

—  2.986  "1"  2 
+  1.766  +  1 

+  1.62e  -  1 
—  5.86e  +  0 

+  3.42e-5' 
+  l.OOe +  0 

a?!  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

-  2.286  +  2 
+  2.466  +  2 

+  1.846  +  0 
-  6.706  +  0 

+  1.68e  +  0' 
+  1.62e-l 

0172  = 

■  +  1.67e  -  1 
-  8.37e  -  2 

-  1.75e  + 1 
+  9.306  +  2 

+  3.516  +  0 
-  7.54e  +  0 

+  3.35e  +  0 
-6.76e-  1 

«73  = 

■  +  1.76e  -  1 
-  1.25e  -  1 

-  1.416  +  3 
+  2.826  +  2 

+  1.006  +  0 
-  3.22e  -  4 

-7.33e-5' 
+  l.OOe +  0 

aj4  = 

■  +  1.76e  -  1 
-  1.25e  -  1 

+  2.82e  +  2 
+ 1.136  +  3 

+  l.OOe  +  0 
-  3.22e  -  4 

-7.33e-5‘ 
+  l.OOe  +  0 

tt75  = 

■  +1.76e-  1 
-  1.25e  -  1 

+  8.466  +  2 
+  1.416  +  3 

+  l.OOe  +  0 
-  3.22e  -  4 

-7.33e-5‘ 
+  l.OOe +  0 

«76  = 

'  +  1.76e  -1 
-  1.25e  -  1 

-  5.64e  +  2 

—  2.826  +  2 

+  l.OOe  +  0 
-  3.94e  -  4 

-7.33e-5' 
+  l.OOe +  0 

Q!77  = 

■  +  1.76e  -  1 
-  1.25e  -  1  - 

-  1.836  -  1 

-  3.07e  -  1 

+  l.OOe  +  0 
-  3.94e  -  4 

-  7.33e-5' 
+  l.OOe  +  0 

0178  = 

■  +  1.76e  -  1 
-  1.25e  -  1 

-  5.646  +  2 

—  2.826  +  2 

+  l.OOe  +  0 
-  3.94e  -  4 

-7.33e-5' 
+  l.OOe +  0 

<^79  = 

■  +  1.76e  -  1 
-  1.25e  -  1 

+  8.46e  +  2 
+  1.41e  +  3 

+  l.OOe  +  0 
-  3.94e  -  4 

-  7.33e-5' 
+  l.OOe  +  0 

«80  = 

■  +  1.76e  -  1 
-  1.25e  -  X 

+  2.826  +  2 
+ 1.136  +  3 

+  l.OOe  +  0 
-  3.94e  -  4 

-7.33e-5' 
+  l.OOe  +  0 

0!81  = 

+  1.76e  -  1 
-  1.25e  -  1 

-  1.41e  +  3 
+  2.826  +  2 

+  l.OOe  +  0 
-  3.94e  -  4 

-7.33e-5' 
+  l.OOe +  0 
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A  = 

■  -  3.03e  -  3 

-  1.66e-3 

+  4.54e-3 

"I"  6.626  —  3 

/?2  = 

-  3.03e  -  3 
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Abstract  -  A  fuzzy  learning  control  technique  is  used  for 
position  tracking  involving  the  vertical  movement  of  a  mass 
attached  to  a  pneumatic  muscle.  Because  the  pneumatic  muscle 
is  nonlinear  and  time  varying,  conventional  fixed  controllers  are 
less  effective  than  the  fuzzy  controller  proposed  in  this  paper. 
The  controller  is  of  a  PID  type,  with  an  adaptive  fuzzy  PD  part 
and  a  nonfuzzy  integral  branch.  A  novelty  of  the  controller  is 
that  the  fuzzy  inverse  model,  which  dynamically  adjusts  the  PD 
part  of  the  controller,  incorporates  the  internal  PM  pressure  as 
an  input.  Experimental  results  are  presented  from  the 
pneumatic  muscle  test  facility  in  the  Human  Effectiveness 
Laboratory  at  Wright-Patterson  Air  Force  Base. 

I.  INTRODUCTION 

A  pneumatic  muscle  (PM)  actuator  [1-3]  is  a  type  of 
artificial  muscle  that  has  been  used  in  robotic  applications. 
The  PM  construction  consists  of  an  ellipsoidal  rubber  tube 
inside  a  very  strong  braided  plastic  sheath  (Fig.  1).  When  the 
tube  is  inflated  due  to  an  increase  in  pressure,  it  widens  and 
the  muscle  shortens,  providing  force  to  move  a  mass  in  the 
direction  of  shortening. 

Compared  to  either  hydraulic  or  electric  motor  actuation 
systems,  the  PM  actuator  has  two  important  advantages  [1] 
which  are  extremely  high  power/weight  ratio  (1  W/g)  and 
power/volume  ratio  (1  W/cm^).  Another  advantage  of  the 
PM  is  that  it  is  a  “soft  actuator”  which,  if  it  fails  (i.e.  bursts), 
does  so  in  a  relatively  safe  manner.  Since  the  PM  consists  of 
a  small  gas  volume  enclosed  in  a  soft  material,  if  a  failure 
occurs,  the  risk  of  human  injury  is  minimal 

The  main  disadvantage  of  PMs  is  that  they  are  highly 
nonlinear  and  time  varying,  since  they  are  made  of  rubber, 
the  characteristics  of  which  vary  with  temperature.  Therefore 
it  is  difficult  to  control  them  precisely  in  a  position  or  force 
control  sense. 


This  research  was  supported  by  AFOSR  grant  #F49620-00- 1-0300, 
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Figure  1  -  Construction  of  pneumatic  muscle 
actuator 


Efforts  to  actuate  pneumatic  muscles  have  taken  several 
approaches.  In  [4],  a  sliding  mode  controller  is  used  for 
tracking.  Inoue  [5]  attempts  to  control  the  PM  using  a  PID- 
like  control  scheme,  Lilly  [6]  presents  proof  of  stability  and 
asymptotic  tracking  for  a  nonlinear  adaptive  controller  for  a 
joint  with  PMs  arranged  in  bicep  and  tricep  configurations. 
Repperger  et  al.  [7]  design  a  variable  structure  controller  for  a 
PM  to  perform  asymptotic  position  tracking. 

Fuzzy  control  [8-11]  has  been  successfully  used  in  many 
commercial  and  industrial  applications  in  recent  years.  The 
fuzzy  model  reference  learning  control  (FMRLC) 
methodology  was  developed  from  the  general  ideas  in 
linguistic  self-organizing  control  and  conventional  model 
reference  adaptive  control  [12].  FMRLC  has  a  learning 
ability  that  improves  the  closed-loop  performance  over  time 
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by  using  measured  data  from  the  plant  operation  to  update  the 
controller.  The  FMRLC  technique  has  been  used  in  the 
control  of  cargo  ship  steering  [13]  and  has  also  been 
employed  to  improve  the  performance  of  an  anti-skid  braking 
system  [14]. 

In  this  paper,  we  utilize  ideas  from  FMRLC  to  design  a 
learning  controller  that  results  in  length  tracking  for  a 
pneumatic  muscle.  The  controller  is  essentially  a  PID  type, 
with  adjustable  fuzzy  proportional  and  derivative  parts,  and  a 
nonfuzzy  integral  branch.  The  fuzzy  PD  consequent 
singleton  locations  are  dynamically  adjusted  by  a  fuzzy 
inverse  model.  A  novelty  of  this  controller  is  that  the  inverse 
model  has  for  one  of  its  inputs  the  interior  pressure  of  the 
PM.  This  helps  closed-loop  performance  in  this  case  because 
the  PM  inflation  dynamics  are  different  from  its  deflation 
dynamics. 

This  paper  is  arranged  as  follows.  Section  2  gives  a 
currently-derived  mathematical  model  of  the  PM.  Section  3 
describes  the  details  of  the  fuzzy  learning  controller  used  for 
position  tracking.  Section  4  presents  the  experimental  results 
mA  discussion,  and  Section  5  contains  conclusions. 

II.  DYNAMIC  Model  OF  Pneumatic  Muscle 

Extensive  efforts  to  model  the  PM  have  been  underway  in 
the  Human  Effectiveness  Laboratory  at  Wright  Patterson 
AFB  in  Ohio  [15].  In  this  study,  the  PM  was  modeled  by  a  3- 
element  mechanical  mode!  shown  in  Fig.  2,  consisting  of  a 
contractile  element,  dashpot,  and  spnng.  These  three 
elements  all  have  pressure-dependent  coefficients. 

The  PM  was  hung  vertically  with  a  mass  attached  at  the 
lower  end  Fig.  3  portrays  the  PM  being  inflated  due  to  an 
increase  in  pressure.  When  inflated,  it  shortens  which  exerts 
force  to  lift  the  mass.  Let  y  =  0  be  the  position  of  the  mass 
when  the  PM  is  completely  deflated.  It  is  shown  in  [15]  that 
the  dynamic  motion  of  the  PM  can  be  modeled  as 

My  + By  + Ky  =  F—Mg  (1) 

where  M  is  the  mass  lifted  by  the  PM,  B  is  the  coefficient  of 
viscous  friction,  K  is  the  spring  coefficient,  F  is  the  force 
exerted  by  the  contractile  element,  and  g  is  the  acceleration  of 
the  gravity.  From  (1),  when  Mg  =  F  and  ^  =  >>  =  0,  the 
mass  is  idle,  i.e.  no  motion  of  the  mass  occurs.  For  F  >  Mg , 
the  right  hand  side  of  (1)  becomes  the  forcing  function  for 
the  system  and  the  mass  rises. 

According  to  [15],  B,  K  and  F  are  dependent  on  the 
internal  pressure  of  the  PM  and  are  given  as: 

/F  =  3.77/’ -0.0138/*^ 

/:  =  32.6  +  1.21F 

5.75  +  0.272P  (Inflation) 

3.41  -  0.03 1 6P  (Deflation) 

where  P  is  the  gauge  pressure  inside  the  PM  in  psi.  The 
coefficients  specified  in  (2)  are  valid  in  the  range 
0  <  P  <  90  psi .  Note  that  the  coefficient  B  in  deflation  is 
smaller  than  in  inflation.  The  reason  is  because  in  deflation 


(2a) 

(2b) 

(2c) 
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K 


Contractile 
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Figure  2  -  Three  element  system  used  to 
model  PM 


the  PM  system  vents  against  a  constant  atmospheric  pressure. 
During  inflation,  however,  the  pressure  buildup  is  in  a  closed 
volume  and  the  forcing  function  has  to  fight  against  itself  as 
it  inflates.  Thus,  during  inflation,  the  net  B  term  is  larger. 

Note  from  (2)  that  the  contractile  force  F,  viscous 
damping  coefficient  B  and  spring  coefficient  K  are  functions 
of  F,  which  is  the  control  variable,  i.e.  P  is  the  independent 
variable  which  can  be  commanded  by  Ae  controller. 
Therefore,  in  order  to  express  the  PM  model  in  terms  of  the 
control  variable  F,  we  rewrite  (1)  as 

+  F(F)y  +  A:(F)y  -  F(F)  -Mg  (3) 

which  becomes 

My  -F  B{P)y  +  (32.6-1-1.2  IP);;  =  3 .77P  -  0.0 1 38P^  -  Mg 

(4) 

with  B  as  in  (2c). 

We  observe  that  (4)  is  not  in  a  form  that  is  solvable  by 
standard  nonlinear  adaptive  control  techniques.  For  instance, 
many  nonlinear  adaptive  control  results  rely  on  the  Use  of 
parameter  adaptive  control  for  obtaining  asymptotically  exact 
cancellation  in  linearizing  control  laws  [16],  [17].  Well- 
known  results  exist  for  plants  that  are  decouplable  by  static 
state  feedback,  i.e.  plants  of  the  form 

x  =  f{x)-¥g{x)u  (5a) 

y  =  h{x)  (5b) 

where  xeSt",  u  and  y  are  the  scalar  input  and  output 
respectively  and  /,  g  are  infinitely  differentiable  functions. 
However,  these  results  are  not  applicable  to  (4)  due  to  the  P^ 
term  and  the  fact  that  B  is  different  for  inflation  than  for 
deflation.  Hence,  it  is  difficult  to  control  this  PM  by  using 
conventional  nonlinear  adaptive  control  techniques. 

III.  FUZZY  Learning  Control 

Fuzzy  control  offers  several  advantages  over  conventional 
control  techniques.  An  important  advantage  for  problems 
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Figure  3  -  Pneumatic  muscle  driving  a  mass. 


such  as  tracking  control  of  PMs  is  that  the  plant  model  is  not 
required  to  be  in  any  particular  form,  or  even  that  the  form  is 
known.  Fuzzy  control  is  rather  based  on  plant  performance. 
An  extension  of  fuzzy  control  is  the  fuzzy  learning  control 
methodology.  In  this  method,  the  fuzzy  controller  is  not 
fixed,  but  rather  it  is  dynamically  adjusted  by  a  fuzzy  inverse 
model.  The  inverse  model  incorporates  expert  knowledge 
about  how  the  plant  should  behave  and  how  the  controller 
should  be  adjusted  if  the  plant  does  not  behave  as  desired. 
The  inverse  model  dynamically  adjusts  the  locations  of  the 
consequent  centers  of  the  fuzzy  controller  according  to  the 
measured  behavior  of  the  plant. 

Our  fuzzy  learning  controller  is  shown  in  Fig.  4.  It  has 
two  main  parts:  the  fuzzy  controller  and  the  learning 
mechanism,  which  includes  the  fuzzy  inverse  model  and 
knowledge-base  modifier.  We  explain  each  of  these  below. 


A .  Fuzzy  PD +/  Learning  Controller 

The  controller  is  of  PID  type  with  a  fuzzy  PD  part  and  a 
nonfuzzy  integral  branch.  The  fuzzy  part  has  inputs 
e{kT)  =  r{kT)-y{kT)  and  c{kT)^{e{kT)-e{{k-\)T))IT 
where  r{kT)  is  the  reference  signal  to  be  tracked,  y{kT )  is 
the  PM  length,  and  T  is  the  sampling  interval.  The  controller 
output  is  the  commanded  PM  pressure  p{kT)  .  The  integral 
branch  of  the  controller  is  nonfuzzy  because  we  have  no 
expert  knowledge  about  how  to  adjust  consequent  centers  for 
this  branch  and  fuzziness  is  not  needed  for  any  other  reason 
(e.g.  making  the  integral  path  nonlinear). 

For  the  fuzzy  (i.e.  PD)  part  of  the  controller,  we  define  1 1 
fuzzy  sets  on  the  e  universe  of  discourse  ( (e),  z  =  1, . . . ,  1 1 ) 
and  1 1  fuzzy  sets  on  the  c  universe  of  discourse 
(  Ai.(c\  z  =  1, . . . ,  1 1 ).  These  fuzzy  sets  are  characterized  by 
1 1  equally-spaced  symmetrical  triangular  membership 
functions  centered  at  [-1,  -.8,  -.6,  -.4,  -.2,  0,  .2,  .4,  .6,  .8,  1] 
which  form  partitions  of  unity  on  their  respective  universes. 

The  controller  rule  base  therefore  consists  of  121  rules, 
each  rule  with  a  separate  consequent.  Therefore  we  will  have 
121  singleton  output  memberships  with  the  consequent 
membership  for  rule  z  at  ,  z  =  1, . . . ,  1 2 1 .  The  input 
membership  functions  are  fixed  and  are  not  tuned. 

Using  minimum  T-norm  for  calculation  of  premise  values 
and  center-average  defuzzification,  the  output  of  the  PD  part 
of  the  controller  is  calculated  as 


Figure  4  -  Fuzzy  learning  PD  + 1  tracking  controller 
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Pppm- 

where  the  summation  is  taken  over  all  121  rules  and  b/ikT) 

‘th 

denotes  the  output  membership  function  for  the  i  rule  at 
time  it?’.  The  centers  fc,  (^7’),  /  =  1, ....  121  are  dynamically 
adjusted  by  the  fuzzy  inverse  model  and  the  learning 
mechanism. 

The  integral  branch  is  a  standard  nonfuz^  integral 
function  of  the  tracking  error.  A  constant  multiple  of  the 
integral  of  the  error  is  added  to  the  fuzzy  PD  output  making 
the  controller  essentially  a  PID  controller  in  which  the  P  and 
D  parts  are  fuzzy  and  adjustable,  while  the  1  part  is  nonfuzzy 
and  fixed. 


B.  Learning  Mechanism 

The  learning  mechanism  consists  of  a  fuzzy  inverse 
model  and  knowledge-base  modifier.  The  fuzzy  inverse 
model  is  a  fuzzy  system  with  inputs 

PM  internal  pressure  gypP{kT) .  The  inverse  model  output 
M){kT)  determines  the  amount  that  the  fuzzy  controller 
output  singletons  are  changed  at  each  time  step.  We  choose 
pressure  to  be  the  third  inverse  model  input  since  pressure  has 
a  direct  effect  on  PM  characteristics  (2).  Simulations  indi^te 
that  controller  P  and  D  gains  should  be  larger  at  low  PM 
pressures  than  at  high  PM  pressures.  This  is  due  to  the  fact 
that  it  requires  more  pressure  input  to  effect  a  given  amount 
of  shortening  when  the  PM  is  deflated  than  when  it  is  inflated 

The  e  and  c  input  universes  for  the  inverse  model  each 
have  11  fuzzy  sets  characterized  by  11  equally-spaced 
symmetrical  triangular  membership  functions  identical  to 
those  of  the  controller.  The  third  input  p  to  the  inverse  model 
has  a  universe  of  discourse  consisting  of  11  fuzzy  sets 
characterized  by  11  equally-spaced  symmetrical  tnangular 
membership  functions  centered  at  [0,  .1,  .2,  .3,  .4,  .5,  .b,  ./, 
8  9,  1]  forming  a  partition  of  unity.  Therefore,  the  inverse 
model  has  11x11x11  =  1331  rules. 


The  output  fuzzy  sets  for  the  inverse  model  are  17 
singletons  spaced  as  [-1,  -.75,  -.5,  -.4,  -.3, -.2, -T,  -.05,  0,  .05,. 

1  2,  .3,  .4,  .5,  .75,  1]  (see  Fig.  5).  The  spacing  is 

nonuniform  to  give  more  change  in  commanded  input 
pressure  when  the  PM  internal  pressure  is  low  (i.e.  inflated  a 
small  amount,  which  corresponds  to  longer  lengths)  and  j^s 
at  high  pressures  (more  inflation,  shorter  lengths).  Ine 
output  Ab(kr)  of  the  inverse  model  is  calculated  similarly  to 
the  output  of  the  fuzzy  controller  of  the  previous  section,  i.e. 
minimum  T-norm  and  center  average  defuzzification. 


Two  representative  parts  of  the  inverse  model  rule  base 
are  shown  in  Figs.  6a  and  6b.  For  example,  when  the  error 
and  derivative  error  are  small  and  positive  and  the  pressure  is 
low  the  output  membership  function  linguistic-numeric  value 
is  7,  corresponding  to  a  positive  addition  of  +0.75  (see 
highlighted  box  in  Fig.  6a).  This  means  the  controller  output 
singletons  should  be  increased  by  a  relatively  large  positive 
increment  to  insure  y(kT)  will  not  continue  to  decrease.  On 


Figure  5  -  Output  singleton  fuzzy  sets  for  inverse 
model  (linguistic-numeric  values 
above  memberships) 


the  other  hand,  for  the  same  values  in  error  and  derivative 
error,  but  high  interna!  pressure,  the  value  is  only  4, 
corresponding  to  an  addition  of  only  +0.3  (see  highlighted 
box  in  Fig.  6b),  which  means  we  need  less  than  half  as  much 
adjustment  in  controller  output  singletons  when  the  PM 
internal  pressure  is  high. 

The  knowledge-base  modifier  modifies  the  PD  part  of  the 
fuzzy  controller  to  improve  tracking  performance.  The 
knowledge-base  modifier  changes  the  knowledge-base  of  the 
fuzzy  controller  by  adjusting  only  the  output  singletons  of  the 
rules  that  are  "on"  at  time  kT  for  the  PD  part  of  the  controller 
according  to  the  update  law: 

bf(kr)  =  bAk-l)T)  +  Ab(kD  (7) 

Note  that  at  each  time  step  only  a  few  controller  output 
singletons  are  updated,  and  the  others  are  left  unchanged. 
Specifically,  no  more  than  4  singletons  are  changed  for  a 
single  time  step  in  our  learning  controller. 

IV.  Experimental  Results  and  Discussion 

The  PM  was  loaded  with  a  weight  of  65  pounds.  The 
fuzzy  learning  controller  was  implemented  with  a  sampling 
time  of  1/64  seconds.  The  controller  gains  used  were  1  > 
g.  =0.1,  gp=100.  The  inverse  model 

gains  were  chosen  to  be  gyp  =  .01  and  gy/,  =  0.02 . 

The  initial  position  of  the  mass  is  0  in.,  corresponding  to 
the  PM  fully  deflated  and  extended.  Figs.  7  and  8  show 
typical  tracking  performance. 

The  error  in  tracking  performance  is  due  to  the  fact  that 
the  supporting  structure  for  the  PM  assembly  is  not 
absolutely  rigid,  giving  rise  to  vibrations  in  the  PM  motion 
(see  Figure  7  especially  at  deflation),  the  fact  that  PM  length 
and  pressure  measurements  are  not  exact,  and  the  fact  that  the 
valves  used  for  inflation  and  deflation  are  not  ideal.  It  should 
be  remembered  that  PMs  are  typically  used  in  opposing  pairs 
(e  g  bicep/tricep),  which  would  probably  remove  most  of  the 
bouncing  seen  in  Figure  7.  Therefore,  the  setup  used  in  these 
experiments,  i.e.  a  single  PM  hanging  vertically,  is  somewhat 
unrealistic. 

It  can  be  observed  from  Figs,  7  and  8  that  fairly  accurate 
tracking  begins  during  the  first  1-2  seconds.  In  general, 
Igciffiing  (as  opposed  to  tracking)  occurs  over  a  longer  time. 
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and  controller  output  singletons  converge  after  30  -  45 
seconds,  depending  on  the  input.  Output  singleton 
convergence  is  not  necessary  for  accurate  tracking.  This 
phenomenon  is  also  present  in  nonfuzzy  model  reference 
adaptive  controllers  [18].  Tracking  performance  can  be 
expected  to  improve  over  time  as  different  situations  (i.e. 
combinations  of  e,  c,  and  p)  are  encountered  and  learning 
becomes  more  complete. 

V.  Conclusions 

The  problem  addressed  in  this  paper  is  position  tracking 
control  for  a  pneumatic  muscle  actuator.  Since  the 
coefficients  in  the  PM  are  poorly  known  and  vary  with  time, 
we  use  a  fuzzy  PID-type  tracking  controller  with  learning 
ability.  The  controller  consists  of  an  adjustable  PD  fuzzy  part 
and  a  parallel  nonfuzzy  integral  branch.  Tracking  is  accurate 
after  a  few  seconds  of  operation,  although  singleton 
convergence  has  not  yet  occurred. 
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Figure  7  -  Experimental  tracking  performance. 


Figure  8  -  Experimental  tracking  perfonnance 
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Figure  6b  -  One  page  of  inverse  model  rule  base  for  high  PM  internal  pressure 
(p  linguistic-numeric  values  5-10) 
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Abstract  -  This  paper  presents  robust  motion  control  of 
pneumatic  muscle  actuators  arranged  in  bicep/tricep  pair 
configuration  to  deal  with  system  and  environmental 
uncertainties  using  the  sliding  mode  approach.  A 
mathematical  model  is  derived  for  an  arm  with  PMs  in 
bicep/tricep  pair  configuration.  A  sliding  mode  controller 
is  designed  to  yield  asymptotic  tracking  of  elbow  angle. 
The  results  of  simulations  are  presented  to  demonstrate  the 
success  of  the  proposed  controller. 

1.  INTRODUCTION 

A  pneumatic  muscle  actuator  (PM)  consists  of  a 
cylindrical,  flexible  rubber  or  plastic  airtight  tube  inside 
a  braided  plastic  sheath  [1].  Its  characteristics  are 
similar  to  those  of  human  skeletal  muscles,  i.e.  PMs  are 
similar  in  size  and  force  capability,  and  force  is  only 
created  through  the  action  of  PM  contraction.  When  the 
tube  is  inflated,  it  widens  and  due  to  the  braided  sheath, 
the  entire  assembly  shortens.  The  force  exerted  when  the 
PM  shortens  is  quite  large  in  proportion  to  the  PM's 
weight.  Pneumatic  muscles  have  the  highest 
power/weight  ratio  (IW/g)  and  power/volume  ratio 

( 1  W/cm^  )  of  any  actuator.  Both  these  ratios  are  about  5 
times  higher  than  electric  motors  or  hydraulic  actuators. 

For  these  reasons,  PMs  have  found  application  in 
robotics,  and  for  strength  assistive  devices  for  humans. 
PMs  have  the  attractive  property  that  humans  are  at  a 
very  low  level  6f  risk  of  injuiy  in  event  of  failure  (i.e. 
bursting^  This  is  not  the  case  for  hydraulic  or  electric 
motor  actuators,  which  produce  a  far  greater  risk  of 
injury  to  the  human  user  around  the  device  in  the  event 
of  actuator  failure. 

However,  since  PMs  are  made  of  flexible  rubber  or 
plastic,  their  characteristics  vary  with  temperature  and 
their  temperature  varies  with  use,  which  makes  them 
more  difficult  to  control.  In  PM  control,  there  are 
typically  significant  discrepancies  between  the  actual 
plant  and  the  mathematical  model  developed  for 
controller  design.  This  difficulty  is  inherent  in  PM 
technology. 

To  ensure  the  required  performance  levels  exist 
despite  the  existence  of  plant/model  mismatches,  we  use 
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variable  structure  control  methodology  for  PM  control 
[2].  Sliding  mode  control  [3-9]  is  one  of  these  methods. 
Sliding  mode  is  a  high-speed  switching  feedback  control 
that  switches  between  two  values  based  upon  some 
switching  criteria.  Sliding  mode  control  drives  the 
nonlinear  plant’s  state  trajectory  onto  a  specified 
surface,  which  is  called  the  sliding  or  switching  surface. 

Since  sliding  controller  design  provides  a  systematic 
approach  to  the  problem  of  maintaining  stability  and 
consistent  performance  in  the  face  of  modeling 
imprecision,  it  is  currently  under  investigation  for  the 
control  of  PMs.  In  this  paper,  we  model  an  arm  with 
PM  actuators  in  bicep/tricep  pair  configuration  and 
utilize  sliding  mode  tracking  control  to  make  the  elbow 
angle  asymptotically  track  a  desired  reference  function 
of  time. 

This  paper  is  arranged  as  follows.  Section  2 
contains  the  derivation  of  a  mathematical  model  of  an 
arm  actuated  by  PMs  in  bicep/tricep  pair  configuration. 
Section  3  addresses  sliding  mode  control  of  limbs 
actuated  by  PMs  and  proposes  a  sliding  mode  controller 
for  PMs  in  bicep/tricep  pair  configuration.  Section  4 
presents  simulation  results  of  the  closed-loop  sliding 
mode  angle  control  of  the  arm  with  PMs  in  bicep/tricep 
pair  configuration.  Section  5  presents  a  discussion  of 
the  results,  and  Section  6  contains  conclusions. 

11.  Modeling  of  Limbs  with  PM  in  Bicep/Tricep  Pair 
Configuration 

Figure  1  shows  a  PM  hanging  vertically  actuating  a 
mass.  When  inflated,  the  PM  shortens  which  exerts 
force  to  lift  the  mass.  Let  y  =  0  be  the  position  of  the 
mass  when  the  PM  is  completely  deflated.  If  the  PM  is 
modeled  as  a  parallel  combination  of  a  spring,  a  dashpot, 
and  a  contractile  element  (as  in  [10]),  the  dynamic 
motion  of  the  system  of  Figure  1  can  be  modeled  as 

Aty-^By  +  Ky  =  F-Mg  (2.1) 

where  M  is  the  mass  lifted  by  the  PM,  B  is  the 
coefficient  of  viscous  friction,  K  is  the  spring 
coefficient,  F  is  the  force  exerted  by  the  contractile 
element,  and  g  is  the  acceleration  of  the  gravity.  Thus 
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Figure  1  ^  PM  hanging  vertically  actuating  a  mass 


the  total  upward  force  exerted  on  the  mass  by  the  PM  is 
F-By-Ky . 

According  to  [10],  B,  K  and  F  are  dependent  on  the 
internal  pressure  of  the  PM  and  are  given  as: 

F  =  3.77P-0.138P^  (2.2a) 

a:  =  32.58  +  1.209?  (2.2b) 


5  = 


'5 .748  +  0.27 1 9?  (Inflation) 
3.41  -  0.03 1 6P  (Deflation) 


(2.2c) 


where  P  is  the  internal  gauge  pressure  of  the  PM. 

Figure  2  shows  an  arm  actuating  a  mass,  with  PMs 
in  the  position  of  a  bicep/tricep  pair.  The  upper  arm 
remains  stationary  as  the  PMs  expand  and  contract, 
moving  the  forearm.  The  upper  ends  of  the  bicep  and 
tricep  are  attached  to  a  motionless  reference  point.  The 
mass  is  held  at  the  end  of  the  forearm  (i.e.  hand).  The 
forearm,  which  is  considered  massless,  is  attached  to  the 
upper  arm  by  a  frictionless  planar  revolute  joint.  The 
PMs  are  attached  to  the  forearm  at  point  A,  which  is  a 
distance  a  from  the  joint.  The  distance  from  the  center 
of  mass  of  the  load  to  the  joint  is  L  The  forearm  is  free 


to  rotate  through  an  angle  9 ,  where  ?  =  0°  corresponds 
to  the  tricep  being  fully  shortened  while  bicep  is  fully 

lengthened,  and  ?  =  180°  corresponds  to  the  tricep  being 
fully  lengthened  while  the  bicep  is  fully  shortened.  For 
simplicity,  we  will  assume  the  PM  force  always  acts 
parallel  to  the  forearm.  This  is  valid  as  long  as  9  is  not 
close  to  either  of  its  extremes. 

Let  subscripts  b  denote  bicep  PM  coefficients  and 
subscripts  t  denote  tricep  PM  coefficients.  Also  let  x* 
denote  bicep  PM  length  and  x,  denote  tricep  PM  length. 
Since  the  total  clockwise  torque  exerted  by  the  bicep  on 
the  elbow  is  AT^x^jnsin?,  the  total 

counterclockwise  torque  exerted  by  the  tricep  on  the 
elbow  is  {F^  -S,x^  -K^x^y  and  the  counterclockwise 

torque  imparted  to  the  elbow  by  gravity  is  MgLsinB , 
the  dynamics  of  the  system  of  Figure  2  are  described  by: 

I9  =  {F^-B,,x^-^xt,)a%m9 
-K^x^)r-MgLs\i\0 

where  I  =  MI?  is  the  moment  of  inertia  of  the  mass 
about  the  elbow  and  g  is  the  acceleration  of  gravity. 
Note  that,  since  the  bicep  force  is  multiplied  by  asm  9 

the  bicep  loses  controllability  at  ^  =  0'’  and  ^  =  180'*. 
Thus,  the  arm  should  be  kept  away  from  these  extremes. 
The  tricep  does  not  have  this  limitation  because  its  cable 

always  makes  an  angle  of  a  =  sin  *  with  the  arm 
regardless  of  9 . 

As  in  (2.2),  we  use  F  =  F\P  ^  K-  ATq  ,  and 
B  =  where  F|=3.77,  A’o  =32.58, 

Ki  =1.209,  and  B^.B^  depend  on  whether  the  PM  in 
question  is  being  inflated  or  deflated,  as  follows: 


Figure  2  -  Arm  with  PMs  in  bicep/tricep  pair 
configuration 


^0  = 


5.748,  inflation 
3.41,  deflation’ 


f  0.2719,  inflation 
I -0.0316,  deflation 


We  have  neglected  the  term  in  F  because  it  tends  to 
be  negligible  in  the  pressure  range  of  operation  of  the 
PM.  The  internal  bicep  and  tricep  pressures  and  P^ 
are  the  control  variables  that  can  be  independently 
commanded  by  the  controller  as  inputs  to  the  system. 
Thus  this  is  a  2-input  system.  Note  that  the  PM 
dynamics  depend  on  whether  the  PM  is  being  inflated  or 
deflated.  We  see  that  (2.3)  is  in  an  unusual  form  for 
control  since  the  control  inputs  enter  into  the  system 
through  the  coefficients  F ,  F ,  and  K  and  not  as  a 
separate  term. 
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To  convert  this  2-input  system  to  a  single-input 
system,  let  us  assume  that  the  bicep  and  tricep  internal 
pressures  are  given  by 

P^=/’0+Ap  (2.4a) 

F,  =Po-Ap  (2.4b) 

where  Pq  is  a  nominal  constant  pressure  and  Ap  is  the 
change  in  pressure  which  is  now  the  independent  control 
input.  Note  that,  with  PM  pressure  defined  as  in  (2.4), 
one  PM  inflating  always  corresponds  to  the  other 
deflating.  Therefore,  one  set  of  B  parameters  (say 
inflation)  will  apply  to  one  of  the  PMs  while  the  other 
set  (deflation)  applies  to  the  other  PM  at  a  given  time. 
When  the  inflation  status  of  the  PMs  changes,  they  trade 
B  parameters.  We  denote  the  bicep  B  coefficients  as 
Bqi,  and  ,  and  the  tricep  B  coefficients  as  Bq,  and 

Bu- 

When  either  PM  is  fully  lengthened,  its  length  is 
defined  as  zero,  and  when  it  is  fully  shortened,  its  length 
is  defined  as  -2a  (i.e.  x  is  the  amount  of  PM 
shortening).  Therefore,  from  Figure  2,  the  bicep  length 
is  xjy  =  a(cos  d-\)  and  the  tricep  length  is 

=  -a(\  +  cos  O) .  Combining  (2.3)  with  the  above 

relationships  for  F,  B,  and  K,  we  obtain  the  following 
2nd  order  equation  describing  the  system  of  Figure  2: 


where 

e  =  f{0.0)+  b{0,0yp 

6 

(2.5) 

f{e,0)=  icji(0,0) 

1  =  1 
.  6 

(2.6a) 

ldifi{0,0) 

j  =  l 

(2.6b) 

In  (2.6),  /|  =  sin  F ,  /2  =  sin 0(cosd  - 1) , 

/3  =(9sin^(9,  /4  =1,  f^=\  +  cos0,  =(9sin(9, 

cj  =  (aF,Fo  -  MgL)II ,  C2  =  (Kq  +  KxPQ)a^  II , 

C3  =  (-^0*  - .  C4  =  -F^Por/I , 

C5  =  (ATo  +  >  ^6  =  ^-^01  -  BuPo)ar/I , 

r/j  =  F\al  I ,  =K\a^  1 1 ,  d-^=  . 

04  =F\rlI ,  d^  =  -K^arU  ,  and  d^  =  Bi,arl I . 

The  model  (2.5)  is  now  in  a  form  suitable  for  sliding 
mode  control. 


III.  Sliding  Mode  Controller  for  PMs  in 
Bicep/Tricep  Configuration 

Pneumatic  muscle  actuators  are  time-variant, 
nonlinear  and  uncertain  due  to  compressibility  of  air, 
static  and  Coulomb  friction,  and  payload  variations.  The 
mass  A/ manipulated  by  the  PM  can  be  expected  to  vary 
significantly  from  use  to  use.  The  coefficients  B  and  K 
vary  with  PM  temperature.  Also,  the  physical  distances 
r,  a,  and  L  may  not  be  precisely  known.  Therefore,  it  is 
difficult  to  develop  a  precise  model  for  PM  systems  such 
as  that  of  Figure  2.  Linear  control  methods  often  lead  to 
poor  performance  for  pneumatic  systems. 

Sliding  mode  control  (SMC)  has  been  used  for  robot 
control,  including  robotics  utilizing  PMs  as  actuators 
[11],  [12].  In  SMC,  rather  than  controlling  the  states  of 
the  system  directly,  the  desired  error  behavior  of  the 
system  is  specified  in  terms  of  a  sliding  surface  in  the 
state  space.  A  discontinuous,  so-called  variable 
structure  control  law  is  used  to  drive  the  state  to  the 
sliding  surface.  The  state  then  moves  along  this  surface 
so  that  the  desired  tracking  performance  is 
asymptotically  achieved.  This  method  is  robust  to 
model  uncertainty,  because  of  the  discontinuous 
feedback. 

In  practice,  imperfect  switching  can  result  in 
unacceptably  high  control  activity  and  chattering  which 
can  excite  unmodeled  dynamics.  For  this  reason 
continuous  feedback  is  often  used  in  a  neighborhood  of 
the  sliding  surface  so  that  the  state  remains  within  a  so- 
called  boundary  layer  near  the  sliding  surface. 

The  typical  sliding  mode  controller  is  composed  of 
a  nominal  part,  similar  to  a  feedback  linearizing  or 
inverse  control  law,  and  additional  terms  aimed  at 
dealing  with  model  uncertainty.  The  problem  of 
designing  a  robust  control  law  by  the  sliding  mode 
approach  can  be  stated  as  follows:  given  a  desired 
sliding  manifold  function  of  the  system’s  states 
(.s(x)=  O),  determine  a  control  input  which  satisfies  the 
sliding  condition.  Then  the  desired  performance  can  be 
achieved  by  an  involved  reduced-order  dynamics  in  the 
sliding  regime.  We  show  that  sliding  mode  gives  a 
viable  alternative  for  high  performance  robotics 
applications  involving  PMs. 

Because  of  our  imperfect  knowledge  of  coefficients 
Fi,  ATo,  ATi,  ^0,  and  5|,  we  must  assume  that 

and  b{0,0)  in  (2.5)  are  imprecise.  Assume  the 
extent  of  the  imprecision  on  f{6,d)  can  be  bounded  by 
a  known  continuous  function  of  0  and  0 .  Similarly,  we 
assume  that  the  extent  of  the  imprecision  on  b{6,d)  can 
be  bounded  by  a  known,  continuous  function  of 
6>and  ^ .  The  control  problem  is  to  get  the  joint  angle 
0{t)  to  track  a  desired  trajectory  0^{t)  in  the  presence 

of  model  imprecision  on  and  b(0,d). 
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Let  /(i^,  o)  be  the  estimate  of f  and  letF(^,^)  be  a 
positive  function  such  that 

\f-f\<F  (3.1) 

Further  assume  the  control  gain  b{d,  &)  is  unknown  but 
of  known  bounds  (0  <  <b<  b^^^  )  where 

and  ^max  depend  on  6  and  0 .  Then  define  the 
gain  margin 

(3.2) 

Let  the  estimation  ofb  be  ^  =  -y/^min^max  * 

Let  be  a  smooth  function  of  time  that 

represents  a  desirable  angular  trajectory  for  the  PM- 
actuated  arm.  Consider  the  sliding  surface  s  =  0, 

s  =  0  +  A0  ,  where  0=6-0^  is  the  tracking  error  and 

A  is  a  scalar  design  parameter. 

Let  ‘sat’  denote  the  saturation  function  defined  as 


corresponds  to  a  maximum  change  in  length  of  the  PM 
of  6  inches. 

Case  1:  The  desired  trajectory  for  the  joint  is 

0^{t)  =  y -H  0.5(sin(2;^j/)  +  sin(2;^20  +  sin(2;r/'3/)) 

(4.1) 

with  /i  =  0.02Hz,  /2  =  0.05Hz,  and  /3  =0.09z.  This 
trajectory  spans  joint  angles  from  approximately  30°  to 

1 5  0°  during  the  period  /  =  0  -  60  sec. 

Let  =  10  and  77=  10  (chosen  by  trial  and  error  to 
yield  good  performance).  We  choose  the  boundary  layer 
thickness  as  O  =  1 .  It  can  be  shown  [4]  that  ^  can 

be  regarded  as  the  guaranteed  tracking  precision. 
Therefore,  for  this  simulation  we  have  £*  =  0,1 . 

Assume  that  the  true  values  of  f{0,d)  and  b{9,6) 
in  (2.5)  are  known  to  fall  within  ±30%  of  the  best 
estimates  we  have  of  them,  i.e.  f{9,6)  and  h{6,9) . 

Then  we  have  F  =  0.3|/| ,  =1.3^,  b^^^  =  0.7^ , 


sat(y)  = 


sgn(y),  otherwise 


and  the  gain  margin  p  is  determined  as  1 .86  by  (3.2). 
(3.3)  The  sliding  control  input  to  the  PM  is  given  in  (3.4) 

with  parameters  defined  as  above.  For  the  simulation, 


Then  the  sliding  mode  control  law  is  given  by 

tsp  =  (Ap-A:sat(5/0))6”^  (3.4) 

where  lsp  =  {-f  +  9^  -  ^9) ,  ^  >  ^(F*  +  7)  +  (^  - 1)| Ap| , 
77  is  a  positive  constant,  and  cb  is  the  thickness  of  a 
“boundary  layer,”  which  is  a  neighborhood  of  the  sliding 
surface  introduced  to  reduce  control  chattering. 

It  can  be  shown  [4]  that  the  control  law  (3.4) 
guarantees  that  the  system  trajectory  reaches  the 
boundary  layer  in  a  finite  time  whatever  the  initial  state, 
and  inside  the  boundary  layer  constrains  trajectories  to 
stay  inside  it  for  all  later  time  and  approach  a 
neighborhood  of  ^  =  0  asymptotically  as  t  goes  to 
infinity.  Asymptotic  tracking  to  within  a  guaranteed 
accuracy  is  therefore  obtained  in  spite  of  modeling 
errors. 

IV.  Simulation  Results 
The  system  of  Figure  2  with  PMs  in  bicep/tricep 
pair  configuration  is  simulated  using  a  4th-order  Runge- 
Kutta  algorithm  with  a  step  size  of  0,01  seconds.  Let  L  = 
1 .5  ft,  a  ==  3  inch,  M=  1  slug,  and  r  =  2  inch.  Since  a  = 
3  inch,  the  full  travel  of  the  forearm  from  ^  =  0  (arm 
fully  straightened)  to  9=7t  (arm  fully  bent) 


the  actual  f{9,9)  and  b{9y9)  terms  were  randomly 
chosen  to  lie  within  ±30%  of  their  modeled  values. 
Figure  3  shows  the  tracking  errors  for  three  different  sets 
of  /  and  b  within  this  range.  It  is  seen  that  for  all 
systems  the  tracking  error  is  within  predicted  bounds, 

with  areas,  of  maximum  error  corresponding  to  9 
changing  signs,  i.e.  places  where  the  arm  motion  has  to 
change  direction.  This  is  especially  noticeable  when  the 
arm  must  change  from  downward  motion  to  upward 
motion. 

Figure  4  shows  a  typical  control  effort  Ap  with 

Pq  =  50  Ib/in^  ,  It  is  evident  that  input  pressure  varies 
smoothly  without  any  obvious  chattering.  Therefore,  by 
using  the  sliding  mode  controller,  the  PM  system 
achieves  desired  performance  with  good  tracking 
precision  and  no  obvious  chattering  for  all  three  systems 
which  may  represent  the  true  arm  with  PMs  in 
bicep/tricep  pair  configuration. 

To  investigate  the  robustness  of  the  sliding 
controller  to  changing  masses,  we  increased  the  mass  M 
to  2  slugs,  i.e.  an  increase  of  100%.  Figure  5  shows 
tracking  errors  for  three  different  actual  arms  randomly 
chosen  within  the  ±30%  range.  Tracking  is  again 
within  predicted  bounds.  Figure  6  shows  a  typical 
control  effort  when  M  =  2 .  Note  that  the  control  effort 
is  larger  than  the  M  =  1  case,  which  is  to  be  expected 
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since  a  heavier  mass  is  being  moved.  The  mass  M  could 
be  increased  more,  but  very  heavy  masses  require  the 
input  pressure  to  be  outside  the  allowed  range  of  PM 
internal  pressure  (0-130  psi).  This  limitation  is  not  the 
sliding  controller’s  shortcoming  however;  it  is  merely  an 
acknowledgement  that  the  PM  internal  pressure  must  be 
kept  within  reasonable  bounds  to  protect  against  actuator 
failure  (bursting).  If  more  force  is  desired,  several  PMs 
can  always  be  placed  in  parallel. 

It  is  noted  from  Figure  6  that  since  Pq  =  50  psi ,  the 

values  of  Ap  would  require  tricep  pressure  P,  to  be 
negative  (2.4b).  This  is  impossible,  and  in  such  a  case, 
we  would  simply  set  P,  =  0 .  The  simulation  reflects 
this.  The  fact  that  tricep  pressure  is  mostly  zero  when 
M  =  2  results  from  the  heavier  mass  exerting  enough 
downward  force  to  track  the  downward  parts  of  the 
reference  trajectory  without  needing  the  tricep  to  help 
pull  the  arm  down. 

Case  2:  To  further  verify  the  sliding  mode  controller, 
another  simulation  is  performed  to  track  a  pseudo-square 
wave  signal  with  a  typical  system  within  the 
±30%  range.  Here,  the  desired  trajectory  is 

l;r,  sin(2;/(t-3))>i 

=  +  ^  sin(2;z/'(/  -  3)),  |sin(2^(/  -  3))|  ^  ^ 

IfT,  sin(2^(/-3))<— ^ 

(4.2) 

with  /  =  0.1  Hz.  This  function  transitions  between 
constant  values  of  ^  and  3^  smoothly  rather  than 

with  discontinuous  jumps.  For  the  design  parameters, 
we  used  A  =  10 ,  77  =  10 ,  and  d>  =  0.3 .  Therefore,  the 
O 

tracking  accuracy  is  s  =  —  =  0.03 . 


Time  (seconds) 


Time  (seconds) 


Figure  4  -  Typical  control  effort  Ap ,  M  =  \  (Case  1 ) 


Time  (seconds) 


Figure  6  -  Typical  control  effort  Ap ,  M  =  2  (Case  1) 


Figure  3  -  Tracking  errors  for  three  possible  actual  arms, 
A/  =  l  (Casel) 
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Time  (seconds) 


Figure  7  -  Tracking  error  (Case  2) 

From  Figure  7,  the  joint  angle  trajectory  is  seen  to 
follow  the  desired  one  with  acceptable  error  except  at 
the  times  of  rapid  transition  between  the  two  constant 
values.  This  is  attributed  to  the  fact  that  in  the 
simulation  the  PM  pressures  are  constrained  to  lie  within 
the  range  [0J30]psi  to  better  conform  to  actual  PM 

operation.  Therefore,  the  needed  input  pressure  dictated 
by  the  sliding  mode  controller  is  not  applied  and 
tracking  accuracy  is  lost. 

V.  Discussion 

The  properties  of  PMs  in  bicep  and  tricep 
configurations  have  already  been  studied  individually  in 
the  context  of  nonlinear  adaptive  control  [13].  In  the 
present  paper,  actuation  via  bicep/tricep  pair  is 
developed.  Sliding  mode  control  methods  have  been 
applied  to  this  problem  since  fixed  controllers  are  less 
robust  to  parameter  changes  than  sliding  mode  ones.  In 
the  case  of  time-invariant  and  well-known  coefficients, 
traditional  methods,  i.e.  PID  may  give  good  results. 
However,  if  coefficients  or  physical  quantities  change 
significantly,  the  fixed  PID  cannot  stabilize  the  system. 
With  the  sliding  mode  controller  given  in  Section  3, 
good  tracking  performance  is  obtained  even  in  the 
presence  of  modeling  uncertainties.  The  two  trajectories 
considered  in  this  paper  are  used  because  they  mimic 
two  common  working  situation  of  the  PMs.  Trajectory 
(4.1)  represents  a  movement  of  the  mass  in  a  smooth 
trajectory.  Trajectory  (4.2)  represents  the  task  of 
holding  the  mass  in  a  stationary  position  and  then  lifting 
it  up  or  dropping  it  down  suddenly.  In  both  cases,  the 
sliding  mode  controller  can  work  with  desirable 
performance  of  good  tracking  precision  and  little 
chattering. 

VI.  Conclusions 

A  dynamic  model  for  an  arm  actuated  by  two 
pneumatic  muscles  in  bicep/tricep  pair  configuration  has 
been  derived.  Assuming  certain  degrees  of  inaccuracy 
in  our  knowledge  of  the  PM  coefficients,  a  sliding  mode 
controller  was  designed  to  achieve  good  tracking 


performance.  In  order  to  eliminate  chattering,  the 
control  action  was  also  designed  to  be  smoothed  to 
achieve  a  trade-off  between  control  bandwidth  and 
tracking  precision.  Simulation  results  demonstrate  the 
effectiveness  of  sliding  mode  control  for  PM 
applications. 
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ABSTRACT— This  paper  studies  the  evolutionary  design  of  a  fuzzy  P+ID  controller  for  an  actual 
pneumatic  muscle  actuator  system.  The  control  of  pneumatic  muscles  is  a  challenging  problem 
because  of  their  high  degree  of  nonlinearity,  time-varying  parameters,  and  uncertainty.  A  fuzzy 
P+ID  controller  is  constructed  using  an  incremental  fuzzy  logic  controller  in  place  of  the 
proportional  term  in  a  conventional  PID  controller.  Several  controller  parameters  arc  optimized  via 
an  evolutionary  algorithm.  The  optimization  is  performed  using  a  recurrent  neuro-fuzzy  dynamic 
model  of  the  muscle  rather  than  the  muscle  itself.  Control  results  arc  presented,  where  the  control 
objective  is  to  force  muscle  length  to  follow  a  reference  signal  under  a  load.  After  evolutionary 
design,  excellent  tracking  performance  is  obtained  with  the  real  muscle  without  the  need  for  further 
tuning  of  controller  parameters.  The  tracking  performance  is  compared  to  that  of  another  fuzzy 
controller. 

Key  Words:  Fuzzy  control,  evolutionary  algorithms,  ncurofuzzy  modeling,  pneumatic  actuators 


1,  INTRODUCTION 

Pneumatic  muscles  (PMs)  are  a  special  type  of  pneumatic  actuator  made  of  an  airtight  rubber  tube  (or 
bladder)  inside  a  strong  braided  plastic  sheath  with  a  helical  weaving,  as  shown  in  Figure  1.  A  valve 
controls  the  flow  of  gas  from  a  pressurized  source  into  the  bladder.  When  the  valve  opens,  gas  is  delivered 
into  the  bladder  and  the  bladder  widens  and  shortens.  The  resulting  contractile  force  is  quite  large  in 
relation  to  the  PM’s  size.  If  inflation  and  deflation  can  be  accurately  controlled,  the  PM  length  and  force 
exerted  can  also  be  controlled. 

In  comparison  with  other  actuators,  pneumatic  muscles  possess  some  unique  advantages.  First,  PMs 
have  extremely  high  power/weight  (IkW/kg)  and  power/volume  ratio  (IW/cm^).  They  are  easily  and 
quickly  constructed  with  few  tools  from  cheap,  commonly-available  materials.  Also,  pressurized  gas  can 
be  created  as  a  byproduct  of  easily  produced  chemical  reactions,  obviating  the  heed  for  heavy  equipment 
such  as  compressors,  batteries,  generators  or  motors.  If  weight  is  not  a  concern,  tanks  filled  with 
pressurized  gas  can  be  used  to  supply  power  to  the  PM.  Finally,  PMs  are  actuators  which,  if  they  fail,  do 
so  in  a  relatively  safe  manner.  PM  failure  is  the  result  of  the  bladder  bursting  under  relatively  low  internal 
pressures,  typically  under  100  psi.  Therefore,  in  contrast  to  hydraulic  or  electrical  actuators,  a  PM  failure 
is  not  likely  to  injure  humans  or  damage  equipment,  making  PMs  good  candidates  for  use  in  close 
proximity  to  humans  and/or  delicate  equipment. 

A  PM  can  actuate  linear  motion  within  its  active  range.  For  PMs  which  are  8  - 12  inches  long  and  2  - 
3  inches  in  diameter,  the  PM’s  active  range  is  a  few  inches,  which  is  roughly  comparable  to  human 
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Figure  1.  Pneumatic  Muscle  Construction. 

muscles.  Therefore,  in  addition  to  other  configurations,  PMs  have  been  Used  in  human-wom  exoskeletons 
for  strength  augmentation  of  military  personnel  or  rehabilitation  for  stroke  victims  and  other  injured  or 
disabled  patients.  PMs  are  also  finding  applications  in  robotics  systems  in  place  of  electrical  actuators  for 
driving  joints  [1  -  4].  They  are  excellent  candidates  for  space,  underwater,  and  other  applications  in  which 
light  weight,  difficulty  of  supplying  power,  and  obtaining  replacement  parts  are  concerns. 

Nevertheless,  PM  use  has  been  limited  because  they  are  very  nonlinear  and  time-varying  (due  to 
differing  inflation  and  deflation  dynamics),  making  accurate  position  and  force  control  challenging 
problems.  Also,  variations  in  load,  the  pressure  of  supply  and  other  mechanical  parameters  cause 
uncertainty  in  the  positional  accuracy  of  the  PM  system. 

So  far,  several  advanced  control  strategies  have  been  utilized  to  control  pneumatic  muscles,  including 
adaptive  control  [5  -  7],  variable  stmcture  control  [8  -  10],  predictive  control  [11],  gain  scheduling  control 
[12]  and  neural  network  control  [13].  These  have  met  with  varying  degrees  of  success.  Fuzzy  logic-based 
control  is  known  to  excel  with  complex  and  highly  nonlinear  systems  with  parameter  uncertainty. 
Moreover  the  design  of  fuzzy  controllers  is  not  model-based,  making  them  well-suited  for  the  control  of 
pneumatic  muscles. 

Several  fuzzy  approaches  to  PM  control  have  been  tried  [14, 15].  In  [14],  Chan  et  al.  proposed  a  fuzzy 
model  reference  learning  controller  (FMRLC)  and  applied  it  to  force  the  muscle  length  to  follow  a 
reference  signal.  In  [15],  Carbonell  et  al.  proposed  a  nonlinear  fuzzy  back-stepping  controller  for  set-point 
regulation  of  a  pneumatic  muscle  actuator. 

In  general,  fuzzy  controllers  have  many  parameters  to  be  determined.  The  tuning  of  these  parameters 
is  typically  done  by  a  time-consuming  trial  and  error  process.  Therefore,  automatic  tuning  of  design 
parameters  is  advantageous.  Some  studies  [16  -  18]  on  evolutionary  computation  have  shown  that 
evolutionary  algorithms  (EAs)  provide  powerful  ways  to  efficiently  search  in  poorly  understood,  irregular 
spaces  inspired  by  the  mechanism  of  natural  evolution.  Following  their  successful  applications  in  many 
difficult  optimization  problems,  EAs  are  becoming  increasingly  used  in  the  design  of  fuzzy  systems.  Work 
in  EA-based  design  of  fuzzy  systems  can  be  divided  into  two  categories:  rule-base  construction  and  rule- 
base  tuning  For  rule-base  construction,  EAs  are  used  for  finding  rules  [19,  20]  and  determining  the 
number  of  rules  [21]  fi-om  a  data  set.  For  rule-base  tuning,  EAs  are  used  for  optimizing  membership 
functions  in  the  fuzzy  system  according  to  some  performance  criterion  [22]. 
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In  this  paper,  we  design  a  fuzzy  P+ID  controller  for  an  actual  pneumatic  muscle  system  located  in  the 
Human  Sensory  Feedback  Laboratory  at  Wright-Patterson  Air  Force  Base  in  Dayton,  Ohio.  For  this 
purpose,  a  recurrent  neuro-fuzzy  model  of  the  PM  is  developed,  and  on  the  basis  of  this  model  a  modified 
evolutionary  algorithm  is  used  to  tune  the  controller  parameters.  The  resulting  controller  is  then  applied  to 
the  actual  PM  with  excellent  tracking  results. 

This  paper  is  organized  as  follows.  Section  2  studies  PM  dynamic  modeling  via  a  recurrent  neuro- 
fuzzy  network.  Section  3  proposes  a  fuzzy  P+ID  control  strategy  for  the  PM  and  an  evolutionary 
algorithm  is  developed  for  tuning  controller  parameters.  Section  4  is  devoted  to  the  identification  of  a 
fuzzy  model  and  the  design  of  the  fuzzy  P+ID  controller  for  the  PM.  In  addition,  some  detailed  control 
results  are  presented.  Finally,  some  remarks  and  conclusions  will  be  drawn  in  Section  5. 


2.  NEURO-FUZZY  MODELING  FOR  PNEUMATIC  MUSCLES 

Several  closed-form  mathematical 
models  have  been  developed  for  PMs  in 
various  configurations  [23  -  26].  The 
physical  configuration  considered  in  our 
research  is  that  of  a  PM  hanging  vertically 
actuating  a  mass  (Figure  2).  In  [25],  the 
dynamics  of  a  pneumatic  muscle  system  in 
the  configuration  of  Figure  2  are  described  by 
a  second-order  nonlinear  differential  equation 
with  differing  inflation  and  deflation 
dynamics.  Inflation  and  deflation  dynamics 
are  different  because  in  deflation  the  PM 
system  vents  against  a  constant  atmospheric 

pressure;  during  inflation,  however,  the  Figure  2.  PM  physical  configuration, 
pressure  buildup  is  in  a  closed  volume  and 
the  forcing  function  has  to  fight  against  itself 
as  it  inflates.  Thus,  deflation  is  easier  than  inflation. 

In  order  to  model  this  system  via  a  recurrent  neuro-fuzzy  inference  system  (RNFIS)  [27],  we  choose 
the  general  model  form 

K^)=/(K^-lXK^-2),4^-l),^(^-2))  +  e(^)  (1) 


where  k  is  the  time  step,  y  is  the  length  of  the  muscle,  x  denotes  the  voltage  input  to  the  valve, /is  some 
nonlinear  function,  and  e  denotes  the  modeling  error.  The  neuro-fuzzy  system  used  to  realize  (1)  is  shown 
in  Figure  3  for  the  case  of  three  fuzzy  sets  on  each  input  universe. 

The  recurrent  neuro-fuzzy  inference  system  in  Figure  3  is  six-layered.  Layer  A  is  an  input  layer,  and 
neurons  in  this  layer  represent  fuzzy  variables  found  in  the  fuzzy  inference  machine.  These  are  volts 
applied  to  the  valve  and  PM  length  in  the  past  two  time  steps.  Layer  B  is  a  term  layer.  Neurons  in  this 
layer  are  called  term  nodes.  They  represent  fuzzy  sets  corresponding  to  linguistic  values  (small,  medium, 
large,  etc.)  of  the  input  variables.  These  fuzzy  sets  are  characterized  by  asymmetrical  triangular 
membership  functions  in  the  form: 

V-m  +  CTi  ^  ^ 

- —  m-<Ti<v<m 

— -  m<v<m  +  <Jji  (2) 

0  otherwise 
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Figure  3.  RNFtS  used  to  model  PM. 
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where  V  represents  an  input  to  the  fuzzy  system  (i.e.  y{kA\ y{k-2\  x(kA\  or  x(A-2)),  w  is  the  value  of  V 
at  which  the  membership  function  attains  its  maximum  value  of  1,  and  <7 1  and  CT are  the  left  and  right 
spreadsofthetriangles,  i.e.  the  support  of  a  particular  membership  ju{v)  is  <V<m  +  cr^}. 

Layer  C  is  a  rule  layer.  Neurons  with  label  H  calculate  the  degree  of  match  for  parts  of  the  premise  of  each 
rule,  which  evaluates  the  degree  of  activation  of  each  rule.  The  label  11  indicates  that  the  product  T-norm 
is  used  to  implement  conjunction  in  the  premise  of  each  rule.  Layer  D  contains  a  neuron  that  calculates  the 

reciprocal  of  sum  of  degrees  of  match  of  rules.  Also  in  this  layer  are  weights  C'  appearing  in  the 
consequents  of  the  rules.  The  neuron  in  layer  E  is  an  aggregation  neuron,  which  sums  the  weighted 
premise  values  for  each  rule.  Layer  F  is  the  defuzzification  layer.  Its  neuron  forms  the  product  of  the 
weighted  premise  values  and  inverse  sum  of  premise  values.  The  output  of  the  neuron  in  this  layer 
represents  the  estimate  of  the  PM  length  at  the  current  time  instant. 

The  task  of  constructing  an  accurate  RNFIS  for  PM  modeling  is  divided  into  two  phases:  structure 
learning  and  parameter  tuning.  The  structure  learning  phase  is  aimed  at  finding  the  structure  of  the  RNFIS, 
such  as  the  membership  functions  and  the  rules  in  the  rule  base.  Structure  learning  involves  fuzzy 
identification  fi-om  data.  The  parameter-tuning  phase  consists  of  optimization  of  all  weights  in  the  RNFIS, 
i.e.  input  and  output  membership  function  centers  and  spreads.  In  the  parameter-tuning  phase,  we  use  the 
well-known  error  backpropagation  algorithm  (BP)  [27]  for  tuning  model  parameters.  We  will  not  address 
BP  as  applied  to  the  network  of  Figure  3  here.  We  now  briefly  discuss  the  structure  learning  phase. 


2.1  Structure  Learning  via  VISIT 

At  the  beginning,  the  RNFIS  is  empty  without  any  rules  or  fuzzy  sets.  A  simple  self-organizing 
algorithm  known  as  the  Variable  Input  Spread  Inference  Training  algorithm  (VISIT)  [28]  is  used  to 
perceive  structure  features  fi'om  a  sequence  of  training  data.  VISIT  is  a  variation  of  the  well-known 
Modified  Learning  From  Examples  (MLFE)  algorithm  for  identification  of  fuzzy  systems  from  data. 
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In  VISIT,  the  membership  functions  can  be  any  shape  in  general,  as  long  as  they  are  convex.  In  this 
paper,  we  will  use  asymmetrical  triangular  input  membership  functions  and  singleton  output  membership 

functions. 

To  begin  the  VISIT  algorithm,  training  constants  cTq  ,  ^ ^  w  are  specified  by  the  user.  The 

constant  (Tq  is  the  initial  value  for  the  spread  of  the  first  membership  function  on  each  universe.  The 

constant  A  determines  when  a  new  membership  function  is  created  on  a  universe  of  discourse.  If  a  new 
training  point  is  within  a  A  -cut  of  an  existing  membership  function  on  a  universe,  a  new  membership 
function  is  not  created  on  that  universe.  The  constant  £  is  the  maximum  identification  error  which  is 
tolerated  before  a  new  rule  and  new  memberships  are  added  to  the  fuzzy  system.  Finally,  w  determines  the 
amount  of  overlap  between  adjacent  memberships  on  a  universe.  We  now  give  the  VISIT  algorithm. 


2.2  VISIT  Algorithm 

1 .  Set  /  =  1 .  Get  the  first  training  pair  tp^  =  {x^,X2,  y)  .  On  each  input  universe  of  discourse 

form  a  fuzzy  set  characterized  by  a  membership  function  (xy  )  with  center  at  nij  =  Xj  and 

spread  (Tq  .  On  the  output  universe  of  discourse  form  a  fuzzy  set  characterized  by  a  singleton 

membership  function  with  support  >>' . 

2.  Add  the  rule 

If  Xi  is  aI  and  X2  is  A2  and  ...  and  x„  is  A},  then 3/ is  . 

3.  If  there  are  no  more  training  pairs,  the  training  cycle  is  completed.  Otherwise,  increment  /  and  get  the 
next  training  pair  tp'  ={xi,  X2,  ■■  ■,  X„,  y)  . 

4.  If  )-  y'  <  ^  where  fifp‘ )  denotes  the  crisp  output  of  the  existing  fuzzy  system  evaluated  at 

the  new  training  pair  tp' ,  discard  tp'  and  go  to  3. 

5  On  theyth  input  universe,  evaluate  all  membership  functions  at  Xj.  Call  the  fuzzy  set  whose 

membership  function  is  maximum  Af^ .  If  x'j  €  A,  —  CUt{Af^^  ),  form  a  new  fuzzy  set  Aj  on 

theyth  universe  characterized  by  a  membership  function  with  center  =  Xy  and  left  and  right 

spreads  fTy  and  (7y  as  in  step  6  below.  On  the  output  universe  form  a  new  fuzzy  set  B^^ 

characterized  by  a  singleton  membership  function  with  support  y' . 

6.  On  each  input  universe  on  which  a  new  membership  function  was  added  in  5,  the  spreads  of  each  side 
of  the  new  membership  function  and  its  nearest  neighbors  on  the  left  and  right  are  re-calculated  as 
follows.  The  right  spread  of  the  new  membership  function  and  the  left  spread  of  the  nearest  right 
neighbor  are  re-calculated  as 


1 


new 


CJ;  = -  m:  —m  / 


fiRn 


(3) 


The  left  spread  of  the  new  membership  function  and  the  right  spread  of  the  nearest  left  neighbor  are  re¬ 
calculated  as 


<^j=- 

w 


(4) 
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where  denotes  the  nearest  existing  center  to  the  right  of  and  denotes  the  nearest 

existing  center  to  the  left  of  * 

7.  Add  the  rule 


^1 


if  there  is  no  other  rule  in  the  rule  base  that  is  inconsistent  (i.e.  same  premise,  different  consequent) 
with  this  rule,  where  the  fuzzy  sets  in  the  premise  are  the  ones  maximized  by  the  corresponding  inputs 

8.  If  there  is  another  rule  (rule p  generated  from  tp^  )  in  the  rule  base  that  is  inconsistent  with  the  rule 

formed  in  7  (rule  q  generated  from  the  present  training  pair  tp' ),  define  fuzzy  system  fpifq)  to  be 

the  fuzzy  system  with  rule p  {q)  included  in  the  rule  base  but  mle  q  ip)  omitted  from  the  rule  base. 
Retain  rule  p  and  omit  rule  q  if 


|/p(</)-/|+|/p(9>')-y|<|A{'/)-/H4fe’')-.>'' 


(5) 


Otherwise,  include  rule  q  and  omit  rule  p. 

9.  Go  to  3. 


2.3  Comment 

It  is  possible  that  the  rule  to  be  added  in  step  7  has  the  same  premise  but  a  different  consequent  than  an 
existing  rule,  i.e.  the  newly  created  rule  is  inconsistent  with  an  existing  rule.  In  such  a  case,  we  need  to 
determine  which  rule  provides  a  better  match  to  the  data  so  we  know  which  rule  to  retain  in  the  rule  base 
and  which  rule  to  omit.  To  do  this,  we  create  two  fuzzy  systems,  each  with  only  one  of  the  inconsistent 
rules  retained,  but  with  the  other  inconsistent  rule  omitted.  In  (5),  a  comparison  of  the  two  fuzzy  systems 
is  made  on  both  training  pairs  that  generated  the  two  inconsistent  rules.  The  fuzzy  system  having  less  total 
identification  error  for  both  training  pairs  is  retained  and  the  other  omitted.  In  this  way,  we  determine 
which  of  the  two  inconsistent  rules  does  a  better  job  of  describing  the  data  and  retain  it  in  the  rule  base, 
omitting  the  other. 

3  EVOLUTIONARY  FUZZY  P+ID  CONTROL 

Fuzzy  controllers  are  gaining  popularity  in  the  control  community  due  to  their  capabilities  of  dealing 
with  uncertainty,  but  their  systematic  design  is  stilt  an  open  problem.  A  typical  fuzzy  controller  found  in 
the  literature  often  has  many  parameters,  some  of  which  are  intercorrelated  and  co-dependent.  It  is  often 
impossible  to  perceive  the  relationship  between  closed-loop  performance  and  the  parameters  of  the  fuzzy 
controller.  This  increases  the  difficulty  of  design  of  fuzzy  controllers.  For  these  reasons,  we  propose  an 
evolutionary  fuzzy  P+ID  controller  for  PM  control. 


3.1  Fuzzy  P+ID  Controller 

A  fuzzy  P+ID  controller,  shown  in  Figure  4,  is  constructed  using  an  incremental  fuzzy  logic  controller 
with  output  Axy  (it)  = /^Z/C(e,e)  in  place  of  the  proportional  term  in  a  conventional  PID  controller. 

The  PM  input  is  a  voltage  Jc(A:)  which  is  expressed  in  incremental  form  as 


Ax(A:)  =  x{k)  -  x(k  - 1) 


=  K  pS^Ax f  (Jc)  +  KjTeik)  -  Kp, 


y{k)-2y{k-\)  +  y{k-l) 
T 


(6) 
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Figure  4.  Fuzzy  P+ID  controller  for  PM. 

In  (6),  Kp,  Kj,  and  are  PID  control  gains,  S^.  is  the  output  scaling  gain  of  the  fuzzy-P 

subsystem,  and  T  is  the  step  size.  In  general,  scaling  gains  Sg  and  S^g  in  Figure  4  are  chosen  so  that 
values  of  SgC  and  Sjgk  lie  in  the  interval  [-1,  1],  and  is  often  chosen  to  be  equal  to  the  reciprocal  of 

Sfjg  .  In  our  approach,  these  scaling  gains  will  be  determined  by  an  evolutionary  algorithm. 

To  reduce  the  number  of  control  parameters,  we  design  membership  functions  for  the  fuzzy-P 
controller  with  the  following  assumptions: 

1.  e{k)  and  e{k)  universesofdiscoursehavefivetriangularmembershipfunctions,  i.e.  NB,  NS,  ZO, 

PS  and  PB.  These  attain  their  maximum  value  of  unity  —  /3 ,  —  OT ,  0,  CX ,  and  respectively  and 
form  a  partition  of  unity.  This  number  of  memberships  is  chosen  to  keep  the  evolutionary  optimization 
task  relatively  simple  while  maintaining  good  control  performance. 

2.  Ar y  (^)  universe  has  five  singleton  membership  functions  located  at  —  ^(NB),  —  0;(NS),  0(ZO), 

a(PS)and  yff(PB). 

A  general  rule  in  the  fiizzy-P  part  of  the  P+ID  controller  can  be  written  as: 

RULEi{i  =  \,2,...,2S)‘ 

If  e{k)  is  nl {e{k))  and  e{k)  is  ju^g{e(ky)  then  Ax^(^  1)  =  C' 

where  //^  and  are  one  of  the  fuzzy  sets  NB,  NS,  ZO,  PS  and  PB.  The  consequent  C'  takes  on 
values  in  the  set  {-  /?,  -  a,  0,  a,  $].  The  rule  base  of  the  fuzzy-P  part  of  the  controller  is  determined 
from  expert  knowledge  and  shown  in  Table  1.  The  crisp  output  of  the  fuzzy-P  system  then  can  be 
calculated  as: 

A»y(i  +  l)  =  J -  m 

Eft 

/ 

where  i  =  1, . .  .25  and  //,•  is  the  premise  value  of  rule  i. 
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Table  I.  Rule  base  for  fuzzy  P+ID  controller. 


Av, 

1 _ ^ _ 

NB 

NS 

ZO 

PS 

PB 

e 

PB 

ZO 

PS 

PB 

PB 

PB 

PS 

NS 

ZO 

PS 

PB 

PB 

ZO 

NB 

NS 

ZO 

PS 

PB 

NS 

NB 

NB 

NS 

ZO 

PS 

NB 

NB 

NB 

NB 

NS 

ZO 

There  are  three  kinds  of  parameters  in  the  fuzzy  P+ID  controller:  fuzzy  control  parameters,  PID 
parameters,  and  scaling  gains  for  the  fuzzy-P  subsystem.  The  quantities  a  and  f  are  fuzzy  control 
parameters.  The  proper  tuning  of  CC  and  >5  will  greatly  improve  performance.  In  general,  a  is  related  to 
steady-state  accuracy  and  is  related  to  the  response  speed.  Small  a  implies  high  steady-state  accuracy; 
large  speeds  up  the  transient  response  [29,  30].  The  PID  parameters  Kp,  Ki ,  and  Kp)  determine 
the  control  resolution  and  the  stability  of  the  controlled  system.  According  to  [29,  30],  the  proven 
sufficient  conditions  show  that  the  system  stability  is  not  destroyed  when  a  fuzzy  P+ID  controller  takes  the 
place  of  a  conventional  PID  controller.  This  means  that  closed-loop  stability  is  guaranteed  regardless  of 
the  choice  of  a  and  p .  The  quantities  Sg ,  5^^ ,  and  are  scaling  gains  for  the  fuzzy-P  subsystem. 

Control  parameters  and  scaling  gains  will  be  determined  by  an  evolutionary  algorithm.  The  PID 
parameters  will  be  determined  using  the  Ziegler-Nichols  tuning  method. 

3.2  Evolutionary  Design  of  Fuzzy  P+ID  Controller 

The  design  of  fuzzy  controllers  is  complicated  by  the  fact  that  we  do  not  know  the  relationship 
between  membership  functions  and  the  control  performance.  To  avoid  a  tedious  trial  and  error  tuning 
process,  the  following  evolutionary  algorithm  is  proposed  to  automatically  tune  the  five  parameters  a , 

P>  Sgy  ^de  ,  and  of  the  fuzzy  P  +1D  controller. 


3.3  Evolution  Algorithm: 

1 .  Initialize  randomly  an  even  number  of  P  individuals. 

2.  Evaluate  the  fitness  of  the  P  individuals  using  the  fitness  function / defined  in  (8)  below. 

3.  Select  the  P/2  fittest  individuals  and  copy  them  as  the  first  P/2  individuals  of  the  next  generation. 
Discard  the  other  individuals. 

4.  Apply  the  mutation  operator  defined  in  (9)  below  to  the  individuals  selected  in  3  and  generate  another 
P/2  individuals  for  the  next  generation. 

5.  Go  to  Step  2  and  repeat  until  the  stopping  criterion  is  met. 

A  real  vector  p{n)  =  [«(«),  P{n),  Sg(n),  Sjg(n),  S^(n)]e  9?^  is  used  in  the  above  algorithm 

to  represent  the  nth  individual  in  the  population,  which  can  be  evolved  by  a  set  of  mutation  operations  and 
the  elitist  selection  operation.  In  each  generation,  the  individuals  with  fitness  in  the  top  50%  are  used  to 
create  the  population  for  the  next  generation.  For  control  applications,  the  well-known  ITAE  criterion  is 
often  used  to  form  the  fitness  function: 

1 


k=\ 


eik) 


r(k) 


kT 


where  K  is  the  total  number  of  time  steps  in  the  calculation,  e(k)  is  the  tracking  error  at  the  Mi  time 
instant,  r{k)  is  the  desired  output  at  the  Idh.  time  instant,  and  T  is  the  step  size.  Note  that  the  reference 
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signal  r{k)  is  always  greater  than  zero.  Thus,  the  larger  the  value  of  the  fitness  function,  the  better  the 


tracking  performance. 

We  define  the  following  mutation  operator  for  creation  of  new 
previous  generation: 


individuals  from  the  fittest  of  the 


Py.»+i  ("+%)  =  + <^yPy('')“P 


f 

J  n 


m 


/r 


m 
max  J 


(9) 


where  y  =  1, . . 5  indexes  parameters  in  individuals,  m  is  the  generation  number,  w  —  1, . . is  the 

index  for  individuals  with  the  P/2  largest  fitnesses  in  the  current  generation,  P  is  the  population  size,  <7 j  is 
a  learning  coefficient  which  determines  the  amount  of  change  of  parameter;  from  generation  to  generation, 
Pj  is  a  Gaussian  random  number  €  [—1,  1] ,  is  the  fitness  of  individual  n  in  the  with  generation, 

and  largest  fitness  in  the  mth  generation. 

In  consideration  of  the  constraints  0  <  «  <  <  1 ,  the  following  techniques  are  used  to  deal  with 

illegal  individuals  possibly  generated  by  the  mutation  operators: 

1 .  If  («  +  ^)  <  0 .  +  =  Pain),  where  pa  is  a  Gaussian  random  number 

e  [o,  l]. 

2.  If  (/7  +  %)  >  1 ,  An+i  (n  +  %)  =  Pfi(n)  ,  where  p^  is  a  Gaussian  random  number 

e[0,  l]- 

3.  l{  a„+i(n  +  ^)>  Pm+](n  +  ^),  exchange  the  values  of  a^+iin  +  ^)  and  Pm+li'^'*' 


4.  EXPERIMENTAL  RESULTS  AND  DISCUSSION 

In  this  section,  we  design  a  fuzzy  P+ID  controller  for  a  PM  hanging  vertically  actuating  a  mass  as 
shown  in  Figure  2.  '  The  system  is  located  in  the  Human  Sensory  Feedback  Laboratory  at  Wright-Patterson 
Air  Force  Base.  The  design  procedure  for  the  controller  is  as  follows.  First,  a  recurrent  neuro-fuzzy  model 
of  the  PM  in  the  form  of  (1)  is  derived  from  experimental  data  taken  from  the  physical  system  using  the 
VISIT  and  BP  algorithms  as  outlined  in  Section  2. 

This  RNFIS  model  is  then  used  in  place  of  the  actual  PM  system  in  the  tuning  of  the  five  parameters  of 
the  fuzzy  controller  ( «,  5*)  using  the  evolutionary  method  discussed  in  Section  3  and  also 

to  determine  the  PID  gains  Kp,  Kj,md  Kq.  The  PID  gains  are  found  via  the  well-known  Ziegler- 
Nichols  tuning  method  applied  to  the  identified  recurrent  neuro-fuzzy  system.  We  discuss  each  of  these 
below. 


4.1  Dynamic  Modeling  of  PM  from  Test  Data 

We  collected  several  sets  of  input-output  data  from  the  PM  and  used  these  for  training  md  test  data  for 
modeling.  A  typical  data  set  is  shown  in  Figure  5.  Using  these  data,  a  RNFIS  model  as  in  Figure  3  was 
developed.  In  the  structure  learning  phase,  we  used  VISIT  with  f  =  0. 1 ,  w  =  0.85 ,  A  =  0.27 ,  and 
(7o  =  1-17  •  Tlies®  parameters  were  chosen  to  yield  a  fuzzy  system  with  a  relatively  small  number  of 
rules  and  fuzzy  sets  on  each  imiverse  while  also  giving  relatively  small  identification  error.  This  provided 
a  good  initial  system  structure  to  be  tuned  further  via  BP. 

In  the  parameter  learning  phase,  we  use  a  BP  algorithm  [27]  for  tuning  model  parameters  (i.e.  centers 
and  spreads  of  the  model  membership  functions  identified  by  VISIT).  After  approximately  20,000 
iterations,  the  BP  training  was  ended  at  a  mean  square  error  of  MSE=0.001 1,  where 
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M5£  =  -^j;(yW-7W)  <>“> 

A=l 

In  (10),  y{k)  is  the  output  of  the  identified  RNFIS  model,  y{k)  is  the  experimentally  measured  PM 
length  (see  Figure  5(b)),  and  ^1  is  the  number  of  input-output  pairs  used.  The  model  output  is  compared 

with  that  of  the  true  PM  in  Figure  6.  The  sampling  time  in  Figure  6  is  second,  yielding 

K\  «  3600 .  Obviously  from  Figure  6,  the  model  well  describes  the  PM  dynamics.  This  process  yielded 
the  following  32  rules  together  with  the  input  membership  functions  plotted  in  Figure  7. 

4.2  Rule-base  for  RNFIS  Model  of  PM 

1.  lfv(A--l)ismfnand;<A-2)ismf2i  and  4^-2)  is  mfj,  and  Jc(A-l)  is  mf4i  theayik)-  1.900 
2  lf  y(k- 1 )  is  1 1  v(^-2)  is  mfo  i  and  x(k-2)  is  mfs?  and  x(k- 1 )  is  mfti  then  y(k)  -2.154 

3.  If  v(^-l)  is  mfn  andV(A-2)  ismfj,  andx(/:-2)  ismfa:  and4*-l)  is  mf42  then7(A:)  =  1.805 

4.  If  v(/t- 1 )  is  mf 1 2  and  y(k-2)  is  mf2 1  and  x(k-2)  is  mfsi  and  jc(A-  1 )  is  mf42  then  y(k)  =  0. 1 60 

5  Ifj'(A-l)  is  mfi2andj'(A-2)  is  mfji  and  jc(/:-2)  is  mf32  and  jc(A-1)  is  mf42  thenj'(A:)  —  2.481 

6  If  y(i^-l)  is  mfp  and  y(A:-2)  is  mf22  and  x(k-2)  is  mf32  and  x(A-l)  is  mf,,  then><A:)  =  2.478 

7.  If /(it-l)  is  mfi2 and;<A:-2)  is  mf22  and x(k-2)  is  mfsi  and x(k-])  is  mf,2  thenX*)  -  0.668 

8  If y(k- 1 )  is  mfi  ■>  and  y(k-2)  is  mf;  i  and  x(k-2)  is  mf32  and  x(k- 1 )  is  mf4i  then  y(k)  -  0.325 

9^  lf;;(/fc-l)ismf,2and;<A-2)ismf22andx(i-2)ismf3i  and  jc(/:-l)  is  mf,,  then ;^(A:)  =  1.001 
10  If /(/t-l)  is  mfp  &ndy{k-2)  is  mf22  and  x(k-2)  is  mf32  and  jc(/:-1)  is  mf43  then^^)  ■"  0.425 

IF  lfM*^-l)  is mf,2and;;(it-2)  is mf22  andx(k-2)  ismf33  and x(A-])  is mf4,  then.y(A)  =  2.179 

1 2 .  If 1 )  is  ®af 1 2  and  y(k-2)  is  mf22  and  x{k-2)  is  mfj  i  and  x(_k- 1 )  is  mf43  then  y{k)  —  1 . 1 63 

13.  lf>>(ifc-l)  is  mfi2 andX^-2)  is  mfii  and x(k-2)  is  mf33  and x{k-\)  is  mf42  then7(A:)  =  0.461 

14  If  Kit-l)  is  mfp  andjy(A-2)  is  mf22  and  Jc(A-2)  is  mfjs  and  x(/r-l)  is  mf43  then3^(A:)  =  0.876 

15.  lfy'(A:-l)  is  mfi3and:Kit-2)  is  mfjj  andac(*-2)  is  mf33  and  jc(A-l)  is  mf,3  then:H(A-)  =  0.573 

16.  lf><jfc-l)  is  nifi3and7(/:-2)  is  mf22  and  a:(A:-2)  is  mf33  andx(A-l)  is  mfii  thenX^)  -  0.829 

1 7  \fy{k- 1 )  is  mf]  3  and  y{k-2)  is  mf22  and  x(k-2)  is  mfs  1  and  x{k- 1 )  is  mfo  then  y(A:)  =  3 .594 

1 8  If  y(it- 1 )  is  mf  1 2  and  ^k-2)  is  mf22  and  xik-2)  is  mf3 1  and  x{k- 1 )  is  mfw  then  yik)  =  0.6 1 1 

19.  IfX^'l)  and  j^A-2)  is  mf22  and x{k-2)  is  mf33  and ^(A-l)  is  mf42  then^A:)  -  1.071 

20.  If  y(^"0  mfi I  and >>(^-2)  is  mf22  andx(/r-2)  is  mf3i  and  jc'(A:-1)  is  mfii  then7(A:)  —  2.271 

21.  If^A-l)  is  mfi2and>^A-2)  is  mf23  andac(A:-2)  is  mf32  and  jc(A-l)  is  mf44  thenj'(/r)  —  1.154 

22.  If  t<A:-1)  is  mfi3andX^-2)  is  mf23  andx(^-2)  is  mf32  and  jc(A:-l)  is  mf43  then7(A:)  =  1.225 

23 .  If  y(A:- 1 )  is  mfi3  and  y(k-2)  is  mf23  and  x{k-2)  is  mfsa  and  x{k- 1 )  is  mf42  then  y(k)  -  1 . 1 69 

24.  If  y(^'l)  is  “^'2  and34A-2)  is  mf23  and x{k-2)  is  mfsi  and  jc(^-1)  is  mfo  then>(A:)  =  0.534 

25.  ^^'(it-l)  is  mfi3and_v(A^-2)  isinf22  andx(A:-2)  is  mf32  andx(A-l)  is  mf42  then_y(A^)  “  0.685 

26.  If;<ifc-1)  is  mfii  and><A-2)  is  mfzi  ?taAxlk-2)  is  mf3i  and  jc(A-l)  is  mfi2  then^A^)  =  1.841 

27  If^i^-l)  is  mfi3and;<A-2)  is  mfjs  andx(A:-2)  is  mf33  and  jc(A:-l)  is  mfts  then;^(A)  =  0.587 

28’  If  y(it-l)  is  mf,3  and:K*-2)  is  mf-j  andx(A-2)  is  mf33  and  jc(A:-l)  is  mfn  thenj;(A:)  =  0.439 

29  If  v(M)  is  mfi3  and  y{k-2)  is  mfjs  and  x{k-2)  is  mf34  and  x(A-l)  is  mf42  then  j(A:)  -  0.607 

30!  If 3<M)  is  mf,3andj^(A:-2)  is  mf23  andx(*-2)  is  mfj:  andx(Ar-l)  is  mf44  then^*)  =  0.094 
31  If>4A:-l)  is  mfi3and_y(A:-2)  ismf23  andx:(A^-2)  is  mf33  and  jc(A-1)  is  mf42  then_y(A:)  -  0.893 

32!  lfX^-1)  is  mf,3  andj<*-2)  is  mf23  and  x{k-2)  is  mf33  and  x(A-l)  is  mfi,  theny(k)  =  2.043 

4  3  Evolutionary  Design  of  Fuzzy  P+ID  Controller  Using  Neuro-fuzzy  Model 

An  optimal  fuzzy  P+ID  controller  is  designed  for  the  pneumatic  muscle  via  the  evolutionary 

algorithm  of  the  previous  section  based  on  the  obtained  RNFIS  model  above.  Note  that  parameters  Ot,  fi , 

5  ,  Sde,  and  ^ck  were  optimized,  so  we  needn’t  specify  scaling  gains  in  advance  according  to  the  actual 

ranges  of  e  and  e .  PID  parameters  were  set  as  Kp  =  —0.7 ,  Kj  =  —0.22 ,  and  Kj)  —  —0.5  using  the 
Ziegler-Nichols  tuning  method  based  on  open-loop  tests  with  the  identified  RNFIS  model. 


Figure  7.  Input  membership  functions  for  the  RNFIS  model  of  PM. 


After  400  generations  with  a  population  of  P  =  120  ,  the  following  optimal  parameters  are  obtained; 
or*  =0.203,  >5*  =0.632,  S'*  =0.9982,  5^^=0.2232,  and  S^.  =1.002.  Corresponding 
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membership  functions  of  the  fuzzy  controller  are  plotted  in  Figure  8,  and  its  control  surface  is  plotted  in 
Figure  9. 

We  tested  the  controller's  ability  to  force  the  PM  to  track  a  reference  signal.  For  these  experiments  the 
pneumatic  muscle  was  loaded  with  a  mass  of  20  kg.  The  initial  length  of  the  muscle  is  5  cm, 
corresponding  to  the  PM  fully  deflated  and  extended.  For  comparison,  the  P+ID  controller  performance  is 
compared  to  that  of  a  fuzzy  model  reference  learning  controller  (FMRLC)  which  was  tuned  by  trial  and 
error  for  best  perfonnance  [14].  The  tracking  performance  of  both  controllers  is  plotted  in  Figure  10,  and  a 
comparison  of  the  tracking  errors  is  shown  in  Figure  11.  The  reference  signal  is  a  combination 
sinusoid/triangle  function. 


Figure  8.  Membership  functions  for  e  and  e . 


Figure  9.  Control  surface  for  fuzzy-P  part  of  controller. 
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Figure  1 1 .  Comparison  of  tracking  errors. 

From  Figure  1 1,  we  see  that  the  evolutionary  fuzzy  P+ID  controller  performance  is  superior  to  that  of 
the  FMRLC  i.e.  the  tracking  error  is  less.  To  compare  the  tracking  performances  rigorously,  we  use  the 
maximal  deviation,  defined  as 

MD  =  nun(j/'(A:)  -  (11) 

and  the  average  deviation,  defined  as 

<12) 

) 

where  k  ranges  over  all  time  samples,  r{k)  is  the  reference  signal,  jp(A:)  is  the  PM  length,  and  K2  is  the 
total  number  of  time  steps.  For  the  FMRLC,  the  maximal  deviation  is  MD  =  0.41  and  the  average 
deviation  is  AD  -  0.1173.  For  the  evolutionary  fuzzy  P+ID  controller,  we  have  MD  =  0.16  and  AD  = 
0.0558,  demonstrating  the  superiority  of  the  evolutionary  fuzzy  P+ID  controller. 

The  control  actions  exerted  by  the  two  controllers  in  the  above  tracking  performances  are  plotted  in 
Figure  12.  The  control  input  from  the  evolutionary  fuzzy  P+ID  controller  is  seen  to  be  less  than  that  from 
the  FMRLC.  Therefore,  the  evolutionary  fuzzy  P+ID  controller  attains  better  tracking  performance  using 
smaller  control  effort  than  the  FMRLC.  Because  the  control  signal  in  the  PM  system  corresponds  to  the 
volts  supplied  to  the  valve,  this  means  that  the  fuzzy  P+ID  controller  is  more  efficient,  i.e.  can  use  less 
energy  and  get  better  results.  This  can  be  valuable  in  applications  where  energy  is  limited,  such  as  use  in 
remote  locations,  underwater  or  spacecraft  applications. 

It  should  be  noted  that  the  FMRLC  used  for  these  experiments  was  tuned  over  many  trials  to  yield 
good  tracking.  To  be  fair,  however,  these  FMRLC  results  may  not  be  the  best  attainable;  better  tracking 
may  result  from  further  tuning.  In  contrast,  the  excellent  tracking  results  shown  in  Figure  10(b)  for  the 
evolutionary  fuzzy  P+ID  controller  were  obtained  upon  first  application  of  the  controller,  i.e.  no  tuning 
was  required.  Most  fuzzy  systems  require  extensive  tuning  before  they  perform  satisfactorily.  Therefore, 
the  combination  of  neuro-fiizzy  modeling  and  evolutionary  design  based  on  this  model  is  a  powerful 
design  tool  for  fuzzy  control. 
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Figure  12.  Comparison  of  control  actions. 


5.  CONCLUSIONS 

The  control  of  pneumatic  muscles  (PM)  is  difficult  due  to  their  complex  dynamic  characteristics  such 
as  nonlinearity,  uncertainty  and  some  time-varying  features  such  as  differing  inflation  and  deflation 
dynamics.  This  paper  studies  the  tracking  control  of  a  pneumatic  muscle  suspended  vertically  actuating  a 
mass.  The  controller  for  this  research  is  an  evolutionary  fuzzy  P+ID  controller  that  was  implemented  on 
the  Pneumatic  Muscle  Test  Station  at  the  Human  Sensory  Feedback  Laboratory  at  Wright-Patterson  Air 
Force  Base.  The  controller  was  designed  using  a  recurrent  neuro-fuzzy  model  of  the  PM  obtained  from 
test  data  taken  in  the  lab.  This  controller  is  compared  with  a  fuzzy  model  reference  learning  controller 
(FMRLC)  also  designed  for  this  task.  Tracking  performance  of  the  evolutionary  fuzzy  P+ID  controller  is 
seen  to  be  superior  to  the  FMRLC  performance.  Also,  tracking  performance  is  immediately  excellent  upon 
first  application  of  the  controller  with  no  tuning  of  the  controller  necessary. 
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Adaptive  Tracking  for  Pneumatic  Muscle  Actuators 
in  Bicep  and  Tricep  Configurations 

John  H.  Lilly.  Senior  Member,  IEEE 


Abstract— Adaptive  tracking  techniques  are  applied  to  pneu¬ 
matic  muscle  actuators  arranged  in  bicep  and  tricep  configura¬ 
tions.  The  control  objective  is  to  force  the  joint  angle  to  track  a 
specified  reference  path.  Mathematical  models  are  derived  for  the 
bicep  and  tricep  configurations.  The  models  are  nonlinear  and  in 
general  time-varying,  making  adaptive  control  desirable.  Stability 
results  are  derived,  and  the  results  of  simulation  studies  are  pre¬ 
sented,  contrasting  the  nonlinear  adaptive  control  to  a  nonadaptive 
PID  control  approach. 

Index  Terms — Exoskeletons,  nonlinear  adaptive  control,  pneu¬ 
matic  muscle  actuators  (PMAs). 

I.  INTRODUCTION 

Braided  pneumatic  muscle  actuators  (PMAs)  possess 
all  the  advantages  of  traditional  pneumatic  actuators 
(i.e.  cheapness,  quickness  of  response,  high  power/weight 
and  power/volume  ratios)  without  the  main  drawback  (i.e. 
compliance  or  sponginess).  For  this  reason,  they  are  finding 
increased  use  in  robotic  systems.  PMA  technology  is  currently 
under  study  for  use  in  exoskeleton  suits  to  be  worn  by  humans 
for  force  and/or  mobility  assistance.  A  difficulty  inherent  in  PM 
technology  for  use  in  precision  and/or  force  applications  is  the 
difficulty  in  controlling  them  precisely.  This  is  because  they  are 
nonlinear  and  time  varying  (i.e.,  since  they  are  made  of  flexible 
mbber  or  plastic,  their  characteristics  vaiy  with  temperature 
and  PM  temperature  varies  with  use).  For  this  reason,  adaptive 
control  approaches  are  currently  under  investigation  for  precise 
control  of  PMs. 

A  PMA  consists  of  a  cylindrical,  flexible  rubber  orplastic  air¬ 
tight  tube  inside  a  braided  plastic  sheath  (see  Fig.  1).  When  the 
tube  is  inflated,  it  widens  and  due  to  the  braided  sheath,  the  en¬ 
tire  assembly  shortens.  The  force  exerted  when  the  PM  shortens 
is  quite  large  in  proportion  to  the  PM’s  weight.  PMs  have  the 
highest  power/weight  ratio  (1  kW/kg  [1])  and  power/volume 
ratio  (1  W/cm^  [2])  of  any  actuator.  PMs  have  been  used  for 
years  in  robotics  to  perform  manipulation  or  precision  tasks 
(3]-[10].  Another  advantage  of  PM  actuators  is  the  ability  to 
make  them  autonomous.  They  are  extremely  lightweight  and 
can  be  made  independent  from  other  power  sources.  They  may 
be  energized  from  a  small  canister  of  gas  that  can  rapidly  create, 
from  a  chemical  reaction,  large  pressures  for  inflation  of  the 
muscle. 
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The  main  drawback  of  PMs  is  that  they  are  very  nonlinear  and 
time  varying,  making  them  difficult  to  control  precisely.  This 
paper  studies  the  closed-loop  control  of  PM  systems  when  used 
within  the  context  of  an  accurate  position-tracking  scenario.  The 
PM  system  is  inherently  a  passive  device  and  is  classified  as 
a  “soft  actuator’*  (due  to  its  compliance),  which  demonstrates 
the  ability  to  fail  gracefully  and  in  a  safe  manner.  Since  such 
systems  are  being  designed  to  be  used  in  contact  with  humans, 
safety  is  a  strong  requisite  which  is  fulfilled  by  the  fact  that  it  is 
both  passive  and  performs  as  a  soft  actuator  (see  [7]). 

PM  systems  can  be  used  to  actuate  an  exoskeleton  frame  worn 
by  humans  to  enhance  strength  and/or  niobility  assistance  for 
humans.  Concepts  developed  from  this  research  can  be  used 
to  help  the  disabled  obtain  better  mobility  enhancement.  Such 
people  have  suffered  from  stoke,  accidents,  or  other  problems  to 
reduce  their  mobility  capability.  The  main  result  will  be  to  aug¬ 
ment  mobility  for  the  veteran  via  the  construction  of  devices  that 
provide  strength  or  mobility  assistance. 

There  have  been  several  investigations  into  applications 
of  PMs  and  their  properties.  PM  research  is  ongoing  at  the 
Human  Sensory  Feedback  (HSF)  Laboratory,  Wright  Patterson 
Air  Force  Base  [6],  [7].  The  HSF  Lab  contains  a  PM  test 
station  that  consists  of  several  PMs,  sensors,  actuators,  and 
instrumentation  to  control  the  PM’s  operation.  Related  work 
is  underway  at  the  University  of  Salford,  U.K.  [1],  [9],  [10]. 
This  work  has  the  aim  of  providing  a  chemo-pneumatic  power 
source  to  drive  PMs.  A  project  involving  rehabilitation  robotics 
is  centered  on  a  wheelchair-mounted  robotic  manipulator  for 
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use  by  the  physically  disabled  and  the  elderly  [1 1].  Efforts  have 
been  directed  toward  the  use  of  low-cost  pneumatic  actuators 
in  the  robotic  manipulator. 

The  Autopod  project  [12]  involves  design  and  testing  of  a 
six-leeged  walking  robot.  The  robot  is  actuated  by  PMs  because 
they  enable  the  robot  to  be  low  cost,  small  and  lightweight,  ro¬ 
bust  and  untethered.  The  BioRobotics  Lab  at  the  University  of 
Washington  has  several  research  projects  that  utilize  PMs.  The 
Powered  Prosthetics  Project  addresses  the  problem  of  amputee 
walking  via  a  PM-powered  prosthesis.  The  Anthroform  Arm 
Project  seeks  to  synthesize  a  robotic  arm  and  controller  based 
completely  on  known  experimental  data  from  human  biome¬ 
chanics  and  neurophysiology.  Both  these  projects  use  the  McK- 
ibben  Artificial  Muscle  [3].  The  lab  is  also  concerned  with  is¬ 
sues  such  as  finite-element  modeling  [13]  and  fatigue  properties 

[14]  of  the  muscle. 

The  Intelligent  Robotics  Lab  at  Vanderbilt  has  developed 
mobile  robots  for  inspection.  The  ROBotic  Inspector  (ROBIN) 

[15] ,  is  intended  to  be  used  for  inspection  of  many  types  of 
man-made  structures  including  bridges,  buildings,  ships,  and 
planes.  Its  motions  are  powered  by  PMs.  A.nother  system, 
the  Intelligent  Soft  Arm  Control  (ISAC)  [16]  is  an  intelligent 
robotic-aid  system  for  the  service  sector  such  as  hospitals  and 
home.  ISAC’s  main  robot  arm  is  called  the  Soft  Arm,  which 
uses  PMs  in  a  manner  resembling  the  movements  of  the  human 

muscle.  „  .  „ 

This  paper  is  arranged  as  follows.  Section  II  contains 

derivations  of  mathematical  models  of  PMs  in  bicep  and 
tricep  configurations.  Section  III  addresses  nonlinear  adaptive 
tracking  of  limbs  actuated  by  PMs  and  presents  stability  results 
for  PM  adaptive  tracking.  Section  IV  presents  simulations 
of  the  closed-loop  adaptive  tracking  behavior  of  limbs  with 
PMs  in  bicep  and  tricep  configurations.  Section  V  presents  a 
discussion  of  the  results,  and  Section  VI  contains  conclusions. 


II.  Dynamic  Modeling  of  Limbs  With  PM 
IN  Bicep  and  Tricep  Positions 

The  dynamic  behavior  of  PMs  has  been  modeled  as  a  combi¬ 
nation  of  a  nonlinear  viscous  friction  and  a  nonlinear  spring  [6], 
[7],  The  PM  is  inflated  (hence,  shortened)  by  opening  a  sole¬ 
noid  which  controls  the  flow  of  pressurized  gas  into  the  rubber 
bladder.  It  is  deflated  by  opening  another  solenoid  venting  the 
contents  of  the  bladder  to  the  atmosphere.  Both  solenoids  are 
linear  i  e  their  degree  of  opening  can  be  controlled  precisely 
for  accurate  control  of  muscle  length.  When  inflated,  the  PM 
shortens,  exerting  a  force  which  is  equal  to  the  product  of  pres¬ 
sure  inside  the  bladder  and  the  surface  area  of  the  bladder.  Since 
the  pressure  inside  the  bladder  is  always  nonnegative,  the  PM 
can  only  exert  force  in  the  direction  of  shortening,  never  in  the 
lengthening  direction.  This  is  to  say  that,  like  human  skeletal 
muscles,  the  PM  exerts  force  when  it  shortens,  but  not  when  it 
lengthens.  The  coefficient  of  viscous  friction  and  spring  coeffi¬ 
cient,  in  addition  to  being  nonlinear,  also  depend  on  whether  the 
PM  is  being  inflated  or  deflated.  The  latter  property  is  because 
deflation  is  easier  than  inflation  due  to  the  air  pressure  inside 
the  bladder. 


Fig.  2.  PM  driving  a  mass. 


Fig.  2  shows  a  pneumatic  muscle  being  inflated  and  lifting 
a  mass.  Let  the  uninflated  (hence,  maximum)  length  of  the  PM 
be  defined  as  :r  =  0.  If  a:(0  is  the  change  in  length  of  the  PM 
from  its  uninflated  length  (hence,  xit)  is  nonpositive  for  all  t), 
the  dynamical  equation  describing  the  system  of  Fig.  2  is 

F  -(-  D(x)x  -f-  K{x)x  =  —Mx  (2.1) 


where  the  coefficients  B{x)  and  K{x)  depend  on  whether  the 
PM  is  being  inflated  or  deflated  and  are  defined  as  (see  [6]  and 
[71) 


Inflation ; 
Deflation : 


f  Bi(a:)  =  0.04x2  ^  ^  3 j.  +  12.6 
\  A'i(x)  =  1.6.'c2  +  10.9x  4-  27.1 
/  BdCx)  =  0.12.'i;2  ^  2A9x  + 14.48 
\  Kh{x}  =  3.6x2  ^  20.7x  +  47.23 


where  x(t)  is  the  PM  change  in  length  in  centimeters. 

The  left-hand  side  of  (2. 1 )  gives  the  sum  of  all  external  forces 
acting  on  the  mass,  as  follows.  The  system  input  F  is  the  upward 
force  exerted  on  the  mass  by  the  PM.  It  is  an  independent  con¬ 
trol  variable  which  can  be  externally  commanded  by  adjusting 
the  inflation  and  deflation  solenoids.  The  force  exerted  by  the 
viscous  friction  action  of  the  PM  is  given  by  -{-B{x)x  (since 
upward  motion  corresponds  to  negative  ir).  The  force  due  to  the 
spring  action  of  the  PM  is  given  by  +K(x)x.  The  right-hand 
side  of  (2.1)  gives  the  force  due  to  the  upward  acceleration  of 
the  mass  (upward  acceleration  corresponds  to  negative  x). 

The  two  basic  configurations  in  which  the  PM  can  be  ar¬ 
ranged  for  use  in  exoskeletons  are  the  bicep-type  (Fig.  3)  and 
tricep-type  (Fig.  4)  configurations.  In  this  paper,  the  control 
problem  for  both  configurations  is  precise  control  of  the  joint 
angle  of  a  limb  which  is  holding  a  mass.  Specifically,  we  wish 
to  actuate  the  PM  by  inflating  and  deflating  it  in  such  a  way  that 
the  joint  angle  follows  a  reference  function  of  time  while  the 
limb  holds  a  mass. 


A.  PM  in  Bicep  Configuration 

Consider  the  limb  configuration  shown  in  Fig.  3,  which  de¬ 
picts  an  arm  lifting  a  mass,  with  the  PM  in  the  position  of  a 
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Fig.  3.  PM  in  bicep  configuration. 


Fig.  4.  PM  in  tricep  configuration. 


bicep.  The  upper  arm  remains  stationary  as  the  PM  expands 
and  contracts,  moving  the  forearm.  The  upper  end  of  the  PM 
and  upper  arm  are  attached  to  a  motionless  reference  point.  The 
mass  M  is  held  at  the  end  of  the  forearm.  The  forearm,  which  is 
considered  massless,  is  attached  to  the  upper  arm  by  a  friction- 
less  joint.  The  PM  is  attached  to  the  forearm  at  point  A,  which 
is  a  distance  a  from  the  joint.  The  distance  from  the  center  of 
mass  of  the  load  to  the  joint  is  L.  The  forearm  is  free  to  ro¬ 
tate  through  an  angle  0,  where  0  =  0°  corresponds  to  the  arm 
being  fully  bent,  i.e.  the  mass  in  the  extreme  upward  position, 
and  0  =  180°  corresponds  to  the  arm  being  fully  straightened, 
i.e.  the  mass  in  the  extreme  downward  position.  For  simplicity, 
we  will  assume  the  PM  force  always  acts  parallel  to  the  forearm. 
This  assumption  is  valid  so  long  as  0  is  not  close  to  either  of  its 
extremes. 

Since  the  upward  force  exerted  by  the  PM  on  the  forearm  at 
point  .4  is  F  -f  B{x)x  +  K{x)x,  the  clockwise  torque  imparted 


to  the  forearm  by  the  PM  is  (F+B(x)x+K(x)x)afim  0.  There¬ 
fore,  the  system  dynamics  are  described  by 

—10  =  (F  +  B(x)d:  +  K{x)x)  a  sin  0  —  MgLsm  0  (2.4) 

where  I  =  ML^  is  the  moment  of  inertia  of  the  mass  about  the 
joint,  g  is  the  acceleration  of  gravity,  and  MgL  sin  0  is  the  coun¬ 
terclockwise  torque  imparted  to  the  forearm  by  gravity.  Then, 
using  r  =  -«(H-  cos  0)  and  x  =  a0  sin  0,  we  can  rewrite  (2.4) 
entirely  in  terms  of  ^  as  follows: 

10  =  —Ba^0(im^0+Ka^Hin0{l+c.oii0)+MgLfim0-Fnn\n0 

(2.5) 

where  B  and  K  are  now  expressed  in  terms  of  0  and  0. 

The  external  input  to  the  system  is  F,  which  is  determined  by 
how  much  the  PM  is  inflated.  Note  that  since  F  is  multiplied  by 
.sin  0  in  (2.5),  the  system  becomes  uncontrollable  zt0  =  0°  and 
at  0  =  180°.  For  this  reason,  joint  angles  should  not  approach 
these  limits.  We  will  see  that  the  tricep  configuration  does  not 
have  this  restriction. 

B.  PM  in  Tricep  Configuration 

Fig.  4  depicts  an  arm  lifting  a  mass  with  the  PM  in  the  position 
of  a  tricep.  The  upper  arm  remains  stationary  as  the  PM  expands 
and  contracts,  moving  the  forearm.  The  lower  end  of  the  PM 
and  upper  arm  are  attached  to  a  motionless  reference  point.  The 
mass  M  is  held  at  the  end  of  the  forearm.  The  forearm,  which 
is  considered  massless,  is  attached  to  the  upper  arm  by  a  fric¬ 
tionless  joint.  Also  at  the  joint  is  a  frictionless  pulley  of  radius 
r,  over  which  a  cable  connecting  the  PM  to  the  foreami  passes. 
The  PM  is  attached  to  the  forearm  at  point  A,  which  is  a  distance 
c  from  the  joint.  The  cable  makes  an  angle  a  =  sin~^  (r/c)  with 
the  forearm.  The  distance  from  the  center  of  mass  of  the  load  to 
the  joint  is  L.  The  forearm  is  free  to  rotate  through  an  angle  0, 
where  ^  =  0°  corresponds  to  the  arm  being  fully  straightened, 
i.e.  the  mass  in  the  extreme  upward  position,  and  0  =  180°  cor¬ 
responds  to  the  arm  being  fully  bent,  i.e.  the  mass  in  the  extreme 
downward  position. 

Since  the  downward  force  exerted  by  the  PM  is  F  -f  B(x)x  -t- 
K (x'jx,  the  clockwise  torque  imparted  to  the  forearm  by  the  PM 
is  (F-i-Z?(.'i;).'j:+if  (3:)3;)csin  a.  Therefore,  the  system  dynamics 
are  described  by 

—10  =  (F  +  B(x)x  -f-  /'('(a:)a;)  csin  A  -  MgL-sin0 
=  (F  +  B(x)x  +  K(x)x)  r  —  MgLsin0  (2.6) 

where  I  =  M is  the  moment  of  inertia  of  the  mass  about  the 
joint,  g  is  the  acceleration  of  gravity,  and  MgL  .sin  0  is  the  coun¬ 
terclockwise  torque  imparted  to  the  forearm  by  gravity.  Then, 
using  x  =  — (l/2)7r7-(l  +  cos0)  and  x  =  (l/2)7n’^sm6>,  we 
can  rewrite  (2.6)  entirely  in  terms  of  0  as  follows: 

..  B7rT~  ’  Kirv^ 

10  = - - — ^.siiiS  H - - — (1 -1-COS0)  -I-  MgLm\0  -  rF 

it  /t 

(2.7) 

where  B  and  K  are  now  expressed  in  terms  of  0  and  0.  Note 
that  the  system  with  PM  in  tricep  position  is  controllable  for  all 
0  because  the  force  exerted  by  the  PM  always  acts  at  an  angle 
a  to  the  forearm  regardless  of  joint  angle. 
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III.  ADAPTIVE  Tracking  for  Limbs  With  PM 
IN  BICEP  AND  Tricep  Position 

The  mass  M  manipulated  by  the  PM  can  be  expected  to  vaiy 
significantly  from  use  to  use.  In  addition,  the  coefficients  D  and 
K  will  vary  with  PM  temperature,  and  from  unit  to  unit.  Also, 
the  physical  distances  r,  a,  and  L  may  vary  from  unit  to  unit. 
Therefore,  the  bicep  (2.5)  and  tricep  (2.7)  models  are  poorly 
known  and  time-varying,  making  nonadaptive  control  methods 
vulnerable  to  failure  when  used  for  tracking  performance  of 
the  PM.  Since  the  nonlinear  functions  of  0  are  known  in  (2.5) 
and  (2.7)  and  only  their  coefficients  are  uncertain,  we  utilize 
a  method  of  nonlinear  adaptive  tracking  based  on  sliding  con¬ 
trol  [19],  [21].  It  uses  a  well-known  result  from  model  reference 
adaptive  control,  which  we  give  without  proof  (see,  e.g.,  [19]). 

Lemma:  Consider  two  signals  e  and  ^  related  by  the  fol¬ 
lowing  dynamic  equation: 

e{t)  =  H(p)  [^:<^^(f)v(<)]  (3.1) 

where  e{t)  is  a  scalar  output  signal,  H{'p)  is  a  strictly  positive 
real  transfer  function,  k  is  an  unknown  constant  with  a  known 
sign,  4{t)  is  a  m  X  1  vector  function  of  time,  and  v(t)  is  a 
measurable  m  x  1  vector.  If  the  vector  (f>  varies  according  to 

^{t)  =  -sgn(&)7ev(t)  (3.2) 


where  k  is  a  positive  constant  and  /i,  «i , . . . ,  ay  are  estimates  of 
the  unknown  parameters  /t,  ai , . . . ,  ay.  With  this  control  law,  we 
have  the  following  result  concerning  the  stability  of  asymptotic 
tracking  of  the  arm  with  PM  in  bicep  position. 

Theorem  I:  Consider  the  PM  in  bicep  position  moving  a 
mass  (Fig.  3).  Assume  the  PM  spring  and  viscous  friction  co¬ 
efficients  are  as  in  (2.2).  If  the  force  F  delivered  by  the  PM  sat¬ 
isfies  (3.6),  then  all  signals  of  the  adaptive  system  are  bounded 
with  liriit_co  e(f )  =  0  provided  the  parameter  estimates  are  ad¬ 
justed  according  to 

h  =  -  'ysj/r  (3.7a) 

2,  =  —  'ysfi,  i  =  (3.7b) 


where  7  is  a  positive  constant. 

Proof:  It  can  be  shown  that  the  tracking  error  from  control 
law  (3.6)  is 

7 

hs  +  ks  =  7iyr  -t-  ^  Tiifi  (3.8) 

<=i 


where  h  =  h-  h  and  a-i  =  Sf  -  o*.  This  can  be  rewritten  as 


s  = 


P+I 


»=i 


(3.9) 


with  7  being  a  positive  constant,  then  e(t)  and  are  globally 
bounded.  Furthermore,  if  v  is  bounded,  then  e(t)  0  as  f 
oc. 

A.  Bicep  Adaptive  Tracking 

Consider  the  problem  of  the  arm  lifting  a  mass  with  PM  in 
bicep  position  as  in  Fig.  3.  If  we  substitute  x  =  -o(l  +  cos  0) 
and  X  =  aB  sin  9  in  (2.5),  we  get  an  equation  in  the  form 

hB+ai9^  sin^  B+a^B^  sin*  B+rizB sin*  B+a.4  sin  B{1  +  cos  Bf 

-EQ.5sin0(l-i-cos5)*-f«6sin0(l-|-cos^)-l-a7sin0  =  -Fsin^ 

or 


This  is  in  the  form  of  (3.1)  with  the  transfer  function  obviously 
being  strictly  positive  real.  Therefore,  we  have  from  the  lemma 
that  all  signals  of  the  adaptive  system  are  bounded. 

Consider  the  Lyapunov  function  candidate 


V  =  hs-  -t-  7"^ 


(3.10) 


where  A  =  ft  -  h.  and  Si  =  a;  -  Oj.  It  is  straightforward  to  show 
that  the  derivative  of  along  the  trajectories  of  the  closed-loop 
system  is  given  by 

V  =  -2ks^.  (3.11) 


hnY2aifi{Bj)  =  -F  sin  6  (3.3) 

1  =  1 

where  ft  =  I /a,  0.1 07  are  parameters  which  depend  on 
the  physical  properties  of  the  system  (i.e.  M,  a,  and  L,  the  co¬ 
efficients  in  (2.2),  etc.),  and  /i, . . . ,  /t  are  known  functions  of 
B  and  B. 

Assume  that  ft,  oi , . . . ,  07  are  unknown  and  it  is  desired  that 
the  PM  angle  6{t)  track  a  known  desired  angle  9d{k).  E)efine  the 
error  e(t)  =  B(i)  -  Bait).  Also,  define  the  signal 

?/■,  (<)  =  yd{t)  -  Aoe(f)  (3.4) 


Therefore,  we  have  s  -+  0  as  f  oc.  It  follows  that 
limt_oo  e(f)  =  0.  Cl 

B.  Tricep  Adaptive  Tracking 

Consider  the  problem  of  the  arm  moving  a  mass  with  PM  in 
tricep  position  as  in  Fig.  4.  If  we  substitute  x  —  -(l/2)7r7-(l  -f 
cos61)  and  x  =  (l/2)7rr0siii61  in  (2.7),  we  get  an  equation  in 
the  form 

hB  +  o,i{9 sin B)  +n2(BsinB)  a3^sin61 -|- 04(1 -1- (:os61) 
-l-aojl  cos 5)*  +  a6(l  +  cos^)  +  ay  sin  B  =  -F  (3.12) 


where  Aq  is  a  positive  constant  and  the  combined  error 
s  =  e  -1-  Aoc. 

Consider  a  control  F  such  that 

7 

-Fsin0  =  ftj/r  —  fcs -F 

i=l 


or 

7 

(3.5)  hB  +  '^aiMB,B)  =  -F  (3.13) 

»=i 

where  ft  =  //r,  ai , . . . ,  ay  are  parameters  which  depend  on  the 
(3  6)  physical  properties  of  the  system  (i.e.  M,  a,  L,  the  coefficients 
in  (2.2),  etc.),  and  fi,...,fy  aie  known  functions  of  B  and  B. 
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Assume  that  h,ai,...,a-  are  unknown  and  it  is  desired  that 
the  PM  angle  9{t)  track  a  known  desired  angle  Define  the 

quantities  e(t)  =  e{t)  -  ?/,.(«).  and  s,  as  above. 

Consider  a  control  F  such  that 

7 

-F  =  hy,-  -  ks  +  ^2^‘fi  (3- 14) 

i=l 

where  A:  is  a  positive  constant  and  h,ai,...,ay  are  estimates 
of  the  unknown  parameters  h,  ai , . . . ,  ay.  Then  we  have  the 
following  result  concerning  the  stability  of  asymptotic  tracking 
of  the  arm  with  PM  in  tricep  position  using  the  control  (3.14); 

Theorem  2:  Consider  the  PM  in  tricep  position  lifting  a  mass 
(Fig.  4).  Assume  the  PM  spring  and  viscous  friction  coefficients 
are  as  in  (2.2).  If  the  force  F  delivered  by  the  PM  satisfies 
(3.14),  then  all  signals  of  the  closed-loop  system  are  bounded 
with  iinu_oo  e(<)  =  0  provided  the  parameter  estimates  are  ad¬ 
justed  according  to 

h=—-ysyr  (3.15a) 

ai=-'fsfi,  i  =  l,...,7  (3.15b) 

where  7  is  a  positive  constant. 

Ptvof:  The  proof  is  similar  to  that  of  Theorem  1 . 

Comment  1:  To  implement  the  control  laws  (3.6)  and  (3. 14), 
it  is  necessary  to  measure  0  and  6.  This  should  be  no  problem 
in  PM  applications,  because  these  are  the  joint  angle  and  its  rate 
of  change,  respectively,  and  are  easily  measured. 

Comment  2:  As  stated  above,  the  bicep  control  F  is  multi¬ 
plied  by  sin  (9,  which  vanishes  as  the  arm  approaches  either  the 
vertical-up  or  vertical-down  position.  Thus,  the  arm  cannot  be 
controlled  in  the  vicinity  of  these  positions.  For  this  reason,  care 
should  be  taken  to  avoid  arm  angles  close  to  vertical-up  or  ver¬ 
tical-down  for  bicep  control.  Tricep  configuration  does  not  have 
this  limitation  due  to  the  fact  that  the  force  is  always  applied  at 
an  angle  a  to  the  forearm,  regardless  of  the  joint  angle. 

Comment  3:  The  assumption  of  coefficients  (2.2)  are  neces¬ 
sary  so  that  the  plant  parameters  are  constants.  This  assumption 
is  equivalent  to  stipulating  that  the  PM  is  not  allowed  to  deflate. 
This  may  be  the  case  if,  e.g.,  the  task  is  to  lift  a  mass.  If  the  PM 
were  inflating  and  deflating,  the  PM  spring  and  viscous  friction 
coefficients  would  switch  between  (2.2)  and  (2.3).  Therefore, 
the  plant  parameters  would  be  time  varying  and  the  adaptive 
tracking  problem  would  be  considerably  more  difficult.  The  as¬ 
sumptions  are  technically  necessary  to  prove  the  theorems,  but 
do  not  appear  to  be  necessary  for  asymptotic  tracking  in  some 
simulations  in  which  both  inflation  and  deflation  are  involved. 
Of  course,  the  theorems  also  apply  to  the  case  where  only  coef¬ 
ficients  (2.3)  are  assumed.  In  this  case,  the  PM  is  not  allowed  to 
inflate  but  only  deflate,  which  might  be  the  case  when  the  task 
is  to  lower  the  mass. 

IV.  Simulation  Results 

The  systems  of  (2.5)  and  (2.7)  were  simulated  using  a  fourth- 
order  Runge-Kutta  algorithm  with  a  step  size  of  0.0 1  s.  The  con¬ 
trol  for  the  bicep  configuration  is  given  by  (3.6)  and  (3.7).  The 


Fig.  5.  Bicep  adaptive  tracking — ^7  =  3  x  lO"*  and  A  =  A;  =  1. 

control  for  the  tricep  configuration  is  given  by  (3.14)  and  (3. 1 5). 
The  results  of  these  simulations  are  given  in  the  following. 

A.  Bicep  Simulation 

Assume  a  configuration  as  in  Fig.  3.  Let  L  =  0.5  m,  a  = 
0.025  m,  M  =  50  kg,  and  g  =  9.807  m/s^.  Since  a  =  0,025  m, 
the  full  travel  of  the  forearm  from  9  =  180°  (arm  fully  straight¬ 
ened)  to  9  =  0°  (arm  fully  bent  )  corresponds  to  a  maximum 
change  in  length  of  the  PM  of  0.05  m.  This  corresponds  approx¬ 
imately  to  the  actual  capability  of  the  PM. 

The  desired  trajectoiy  for  the  joint  angle  between  0  and  15  s 
is 

BAt)  =  [60®  -I-  62.5®  (siii(27r/if)  +  .05sm(27r/2f))] 

18.0  .. . 
(4.1) 

with  /i  =  0.01  and  /2  =0.1  Hz.  Therefore,  Oj{t)  is  a  sum 
of  two  sinusoids  with  initial  condition  0<i(O)  =  60®.  This  tra¬ 
jectory  spans  monotonically  increasing  joint  angles  from  60°  to 
approximately  110°  and  corresponds  to  the  arm  lifting  a  mass 
from  a  lower  level  to  a  higher  level  along  the  prescribed  trajec¬ 
tory. 

The  input  to  the  PM  is  given  by  (3.6)  and  (3.7)  with  7  =  3  x 
10“*,  Ao  =  1,  and  k  =  1.  The  initial  parameter  guesses  are  zero, 
and  the  initial  joint  angle  is  42°.  The  desired  and  actual  joint 
angles  are  shown  in  Fig.  5.  It  is  seen  that  the  arm  asymptotically 
tracks  0d{t)  after  the  initial  adaptation  stage  (approximately  4  s)! 

B.  Tricep  Simulation 

Assume  a  configuration  as  in  Fig.  4.  Let  L  =  0.5  m,  r  = 
(O.OS/tt)  m,  M  =  50  kg,  and  g  =  9.087  m/s^.  Since  r  — 
(O.OS/tt)  m,  the  full  travel  of  the  forearm  from  0  =  180®  (  arm 
fully  bent)  to  0  =  0®  (arm  fully  straightened)  corresponds  to  a 
maximum  change  in  length  of  the  PM  of  0.05  m. 

The  desired  trajectory  for  the  joint  angle  is  again  as  in  (4.1). 
The  input  to  the  PM  is  given  by  (3.14)  and  (3.15)  with  7  = 

1  X  10®,  A()  =  1,  and  fc  =  1.  The  initial  parameter  guesses  are 
zero,  and  the  initial  joint  angle  is  42°.  The  desired  and  actual 
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Fig.  6.  Tricep  adaptive  tracking— 7  =  1  x  10’  and  A  =  At  =  1. 

joint  angles  are  shown  in  Fig.  6.  Again,  we  have  asymptotic 
tracking  except  in  the  initial  adaptation  stage. 

V.  Discussion 

The  simulations  of  PM  in  bicep  and  tricep  configurations 
have  been  designed  to  closely  conform  to  use  in  PM-actuated 
exoskeletons.  In  an  exoskeleton,  there  are  no  rigid  rods  for 
forearm  and  upper  arm,  but  the  exoskeleton  may  possess 
some  form  of  rigidity,  i.e.,  a  rigid  enclosure  for  a  human  limb. 
Exoskeleton  PMs  are  arranged  in  configurations  very  similar  to 
human  skeletal  muscles,  i.e.  agonist/antagonist  or  bicep/tricep 
pairs.  The  bicep  and  tricep  results  in  this  paper  apply  to  PMs 
used  anywhere  in  an  exoskeleton  (amis,  legs,  etc.),  as  long 
as  they  are  arranged  in  bicep  or  tricep  configurations.  An 
exoskeletoti  PM  in  the  tricep  configuration  must  have  a  path 
over  which  the  PM  cable  passes  to  attach  to  the  limb  past  the 
joint.  This  track  has  been  modeled  as  a  frictionless  pulley  in 
this  study. 

In  typical  exoskeleton  applications,  the  mass  actuated  by  the 
PM,  or  the  moment  of  inertia  of  the  moving  joint,  will  vary 
significantly  due  to  changing  joint  angles.  For  instance,  when 
mwing  a  mass  from  one  point  to  another,  the  arm  may  bend, 
changing  the  load  to  the  PM,  which  nevertheless  must  actuate 
the  limb  to  follow  a  desired  reference  trajectory.  This  situa¬ 
tion  arises  in  robotics  as  well.  Also,  the  nonlinear  spring  and 
nonlinear  viscous  damping  coefficients  are  poorly  known  and 
change  with  time.  This  is  because  with  use,  the  PM  heats  up, 
changing  these  coefficients.  In  addition,  physical  properties  of 
the  exosiceleton,  i.e.,  ami  lengths,  distances  to  attach  points,  etc., 
may  be  poorly  known. 

Therefore,  adaptive  control  methods  have  been  applied  to 
this  problem,  since  fixed  controllers  are  less  robust  to  param¬ 
eter  changes  than  adaptive  ones.  The  simulations  in  Section  IV 
were  also  carried  out  with  a  PID  controller  designed  to  give 
good  perforaiance  with  M  =  50  kg.  If  the  mass  remains  in  the 
vicinity  of  this  value,  the  PID  gives  good  results.  However,  if 
the  mass  changes  significantly,  the  fixed  PID  cannot  stabilize  the 
system.  With  the  ad^tive  controllers  given  in  Section  III,  M  can 


undergo  a  threefold  change  while  retaining  adequate  tracking. 
However,  the  fixed  PID  is  much  less  tolerant  to  changes  in  M, 
failing  to  stabilize  the  system  for  M  >  80  kg. 

Finally,  we  note  that  in  real  applications  of  PMs,  they  will 
most  probably  be  arranged  in  agonist/antagonist  pairs,  as  in 
[4].  Therefore,  there  will  be  a  bicep/tricep  pair  rather  than  a 
single  bicep  or  tricep  acting  alone.  This  would  increase  joint 
impedance  and  result  in  a  more  stable  joint  angle  control 
problem.  The  present  paper  is  intended  to  study  the  action  of 
individual  muscles  only,  without  introducing  agonist/antagonist 
interaction. 


VI.  Conclusion 

Dynamic  models  for  pneumatic  muscles  in  bicep  and  tricep 
configurations  actuating  a  mass  have  been  derived.  These  con¬ 
figurations  are  very  similar  to  exoskeleton  applications  in  which 
PM’s  are  used  to  increase  strength  and  mobility  in  humans. 
The  models  are  second-order  and  nonlinear  in  the  joint  angle. 
Their  form  makes  them  amenable  to  nonlinear  adaptive  control 
techniques,  since  the  nonlinear  functions  of  the  joint  angle  are 
known,  with  only  physical  constants  of  the  system  being  un¬ 
known.  Simulations  of  closed-loop  adaptive  tracking  of  limbs 
moving  masses  show  that  adaptive  control  techniques  are  supe¬ 
rior  to  fixed  methods,  i.e.,  fixed  PID  controllers. 
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